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S. PNEUMONIAE ANTIGENS 
The present invention relates to isolated nucleic acid molecules, which encode antigens for Streptococcus 

V ZZ 0n Z T S T b i e ^ * P^ 3 ^ <* pharmaceutical medicaments for the pTeveX 

and treatment of bacterial mf ections caused by Streptococcus pneumoniae. Prevention 

Streptoccccus . pnetnnoniae (Pneurnococcus) is a lancet-shaped, gram-positive, facultative anaerobic 
bacterium. It is only the encapsulated organism that is pathogenic for humans and experimental^^ 
Capsuh* are antigenic and form the basis for classifying pneumococci by serotype Ninety series 
have been identified, based on meir reaction with type-specific antisera. Most TpneumonZ sZZZ 

and^^ 

oTS^S ZTT T^Tt ° f ^ to ^ toa ^^ d ^y be isolated ftom me rtasopharynx 
of 5/o to 70 /o of normal adults Rates of asymptomatic carriage vary with age, environment, an^ 
presence of upper respu^atory infections. Only 5%-10% of adults without children are carriers LXols 

n? a nvT s ^^ f 58% ° f StUd T ^ rGSidente ^ ^ carriers. On n^taryTtaU^ot a 
many as 50% to 60% of service personnel may be Carriers. The duration of carriage varies and is generally 
longer in children than adults (reviewed in Pn^nm^i rt ^ , ^ . B vcu.i«>«uiui4,generauy 
Diseases, 7th Edition-Second Prmt^e^Bo^ ^ ^ ° f Vaccine-Preventable 

The relationship of carriage to the development of natural immunity is poorlv understood In adHiti™ 
the immunologic mechanism that allows disease to occur in a carrier Is po^y ' 

Streptococcus pneumoniae is an important agent of human disease at the extremities of age and in those 
who ^e underlying disease. Pneumococcal disease kills more people - in the US 40,000 oTmo^et A 
year - than aU other vaccine preventable diseases combined. The major dUbd sylTm^of 
pneumococcal disease include pneumonia, bacteremia, and meningitis. The disease most^ften occurs 
when a predisposing condition exists, particularly pulmoruuyTisease. It is a ^on^acSTa 
complication of antecedent viral respiratory infection such as influenza and measles a^dofcSo" 

ZldnTandlu 1^°™ Z 5 *"* 1 ™ ^^^'d^^cor^veheartfeil^^S^ 
smoking and alcohohsm. Pneumococcal infections are more common during the winter and in eariy 
sprmg when respiratory diseases are more prevalent. Immunodeficiency (splenic dysfun^on iatiogtf 
^^To%^ --eased" bactS 

S pneumoniae is responsible for 88% of bacteremia infections in the US. Pneumonia is the most common 

Sudani 5of ofToXt 15a ° 0W ™° 0 ^ P« year (US). 36% of adult 

acquired and 50% of hospital-acquired pneumonia is caused by S. pneumoniae (US) The incidence of 
disease among adults aged 65 years and older has been reported to ^^^7100000 r^ feSin 
rates for this disease increase from 1.4% for those aged two or younger to Z ST^SE" ^ n *£Z 
aged 80 or older. Diseases caused by influenza J ^coc^^« t^f^ ^ 

^^^^ 

PreventableDiseases, 7th Edition-Second St^S^^"^ " * 
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m children, Pneumococci are a common cause of acute otitis media, and are detected in 28%-55% of 

H ^^ PirateS ; By 386 12 m ° nthS ' 62% ° £ have ^ at le ** one episode'f ac^tf otitis 

media. Middle ejections are the most frequent reasons for pediatric office visits in the UntSi Stated 
resulting m over 20 million visits annually. Complications of pneumococcal otitis media CtaSS 
mastoiditis and meningitis. Bacteremia without a known site of infection is the most comm^ in^Sve 

C ^J T t T * yCarS ° f ag6 ' aCCOUntin S for aPP-ximately 7oTof SvSve 

™ 2^ ag i Sr ° UP ' P neumonia accounts 12%-16% of invasive pneumococcal disease 

ST2 8 ^ ft ^ ^ *■ d6Cline ° f dtoM ^ S - F^momoe has become tte 

tTa^IvTZhthTf ^ * ^ ° f ^ e m * e United States. Children <1 

Cburd^of^ of Pneumococcal meningitis, approximately 10 cases per 100 ,000 population. 

The burden of pneumococcal disease among children <5 years of age is significant. An estimated 17 000 
cases of invasive dxsease occur each year, of which 13,000 are bacteremTwithout a^n lite of 
infection and about 700 are of meningitis. An estimated 200 children die every year as a L^tTmvIsive 
oSd^TeX ^Although not considered invasive disease, an estimated 5 million t£ 7ac^e 
ofabs medaa occur each year among children <5 years of age (reviewed Epidemiology and Prevention^ 
Vaccme-Preventable Diseases, 7th Edition-Second Printing, The Pink Book). revention of 

A definitive diagnosis of infection with Streptococcus pneumoniae generally relies on isolation of the 
organism from blood or other normally sterile body sites. Tests afe also avauableTo de^ cap u^r 
polysaccharide antigen in body fluids. capsular 

P^dllin is the drug of choice for treatment. However, successful implementation of anti-infective 
oTS 7 - C ° me T easkl ^ b — * of widespread antimicrobial resistance T^i^Z 

Penicillin is rising, and according to recent reports it reaches ~ 25% in the US {Whitney, C eTaf^OOoT 

IrSy oS?^ SttainS reaChGd - 20 % T - 64 * ^aTtimliZ 

W^I ZZ^r *" mCreaSe * reSiStanCe ° f * — to - la ^ -d macrolides 

However, even with effective antibiotic therapy (sensitive strains), the case fatality rate of invasive 
dxsease xs hxgh wxth an average of 10% in the developed world and can be much WgT« wUh 
serotypes, m elderly patients and in cases of bacteremia or meningitis (up to 80%). 

rva^ctr^r^ 3 T* 311 6ffeCtiVe treatment t0 P reVent OT borate spneumoococcal infections 
A vaccine could not only prevent infections by streptococci, but more specifically prevent or ameUoraTe 
colonization of host tissues (esp. in nasopharynx), thereby reducing the incidence 7^1*^2™ 
mictions and other suppurative infections, such otitis media. Elimination oftS^7 
" bac f« - d and sepsis - would be a direct cor^equence T^uZ me 

° f 3CUte ^ff 011 ^ Carriage ° f ^ ° r S anisrn - Va «*«* capable of shewing cros^prote^tion 
agamst the majority of S. P ne«« strains causing human infections would also beTseS to HenTor 
ameliorate infections caused by all other streptococcal species, namely groups A, B, C and G. * 

A vaccine can contain a whole variety of different antigens. Examples of antigens are who1e-ki11«H n 
attenuated organisms, subfractions of these organisms/tissues, proteins, or ^STos^mp^form 
Peptide!' * ^ T f ° ^ reCOgni2ed * ~ e ^m in form' of gtycos7aLZiin? or 
e ^totoZ T c :il^CTr C .r m P ° lySaCCharideS °* ^ Short peptide! can be useHut^fo 
example cytotoxic T-cells (CTL) recognize antigens in form of short usually 8-11 amino acids Ion* 
peptides in conjunction with major histocompatibility complex (MHC) B-celircan^L^r,- 8 

orTrootrTus^^^ 
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encounter the antigen(s) followed by presentation of processed or unmodified antigen to immune effector 
cells. Intermediate cell types may also be involved. Only effector cells with the appropriate specificity are 
activated in a productive immune response. The adjuvant may also locally retain antigens and co-injected 
other factors. In addition the adjuvant may act as a chemoattractant for other immune cells or may act 
locally and/or systemically as a stimulating agent for the immune system. 

Efforts to develop effective pneumococcal vaccines began as early as 1911. However, with the advent of 
penicillin in the 1940s, interest in the vaccine declined, until it was observed that many patients still died 
despite antibiotic treatment. By the late 60s, efforts were again being made to develop a polyvalent 
vaccine The first pneumococcal vaccines contained purified capsular polysaccharide antigen from 14 
different types of pneumococcal bacteria. In 1983, a 23-valent polysaccharide vaccine (PPV23) was 
licensed and replaced the 14- valent vaccine, which is no longer produced. PPV23 contains 
polysaccharide antigen from 23 types of pneumococcal bacteria which cause 88% of bacteremic 

<T * addi ?° n ' OOSS - reactivit y occurs fo ' ^eral capsular types which account for an 
additional 8 A of bacteremic disease. Two polysaccharide vaccines are available in the United States 
(Pneumovax 23, Merck, and Pnu-Immune 23, Wyeth-Lederle). Both vaccines contain 25 ug of each 
antigen per dose and include either phenol or thimerosal as a preservative. 

The first pneumococcal conjugate vaccine (PCV7, Prevnar) was licensed in the United States in 2000 It 
includes purified capsular polysaccharide of 7 serotypes of S. pneumoniae (4, 9V, 14, 19F, 23F, 18C, and 6B) 
conjugated to a nontoxic variant of diphtheria toxin known as CRM197. The serotypes included in 
Prevnar accounted for 86% of bacteremia, 83% of meningitis, and 65% of acute otftb media among 
children <6 years of age in the United States during 1978-1994 (reviewed in Epidemiology and Prevention 
of Vaccine-Preventable Diseases, 7th Edition-Second Printing, The Pink Book). Additional pneumococcal 
polysaccharide conjugate vaccines containing 9 and 11 serotypes of S. pneumoniae are being developed 
The vaccine is administered intramuscularly. After 4 doses of Prevnar vaccine, virtually all healthy 
infants develop antibody to all 7 serotypes contained in the vaccine. Prevnar has also been shown to be 
immunogenic m infants and children, including those with sickle cell disease and HIV infection. In a 
large clinical trial, Prevnar was shown to reduce invasive disease caused by vaccine serotypes, and 

^ e H m p aS1Ve uT< 011,86(1 by ^ Ser0typeS ' indudin S serot yP es not » *e vaccine. ChilXen who 
received Prevnar had fewer episodes of acute otitis media and underwent fewer tympanostomy tube 
placements than unvaccinated children. The duration of protection following PrZar is cuie^tiy 
unknown. Immunization with Prevnar reduces the rate of nasopharyngeal carriage of the vaccine 
serotypes, while the overall carriage rate is unaffected. Unfortunate^,^ also bJfahown to tato 

Sp^ 

ro^Z^ood^ 6 " reC ° mmend f 1 to be administered routinely to i., all children as part of the 
ZTJ^U "nmunization schedule, ii., adults 65 years of age and older and iii., persons aged >2 

years with normal unmune systems who have chronic illnesses, including cardiovascular disease 
K^^' diab6teS ' ^ or cerebrospinal fluid leaks"* the elderly poputaTon 
tile target groups for pneumococcal vaccine and influenza vaccine overlap. These vaccines can be given at 
the same time at different sites without increased side effects. s can oe given at 

High mortality is observed among high-risk individuals (with underlying disease - mainly viral 
espiratory infection, immunocompromise) even with effective antibiotic Zapy. The mTb approS 
Sacten^ ™\**™**ease and provides immediate immune enhancement for the deaSnTe 

ty^zJ^t baCteria *** ^ WitWn P 1 -^ ceUs ™* in the blood 

by antibiotics^ This mechanism of action can help to eliminate the release of toxins (such as pneumolysin 
and other cytotoxms), which worsen the clinical condition of septic patients. Recent adva^Tui Z 
technology of monoclonal antibody production provide the means to genera* fhumaTan^y" agenl 
and reintroduce antibody therapies, while avoiding the tc^dties'assocUtea^^ Spy 
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ta^unoglobulins are an extremely versatile class of antimicrobial proteins that can be used to prevent 
and treat emerging infectious diseases. Antibody therapy has been effective against a variety ofcuverTl 
microorganisms reviewed in {Burnie, J. et al v 1998}. V cuverse 

^^7^ andb0dieS ^ ^ 8h0Wn t0 be ^ P^tective, it remains unclear what 

Ser ° t yP e -. S P edfic bodies protect against disease and more recently it hi 

XtlSST* - 1 ^ ° f ^ «" »*- - ™» ^-rminan- of 

Protein conjugate vaccines are no doubt a great new addition to the amarmatorium in the battle against 
pneumococca disease, but the vaccine contains a limited number of pneumococcal sTro^efa^d ^ 
adequate ecological pressure, replacement disease by non-vaccine^erotypes remams^ re" t£eT 
particularly m areas with very high disease burden. remains a real threat, 

During the last decade the immunogenic^ and protective capacity of several pneumococcal proteins 
have been described in animal models and these are now beingLplored for the deveZm^f sped's 
common protein based vaccines. Such proteins are the Pneumococcal surface protein A (PsvA 

Hr 9961? S'b^ H ' 6t 2 ° 03,) ' Pn — adhesin A gZ?v£££ 
v. et alv 1996}), Choline binding protein A (CbpA, (Rosenow C et al 1QQ71. jJr ~i • -5 

pro ems are involved m direct interactions with host tissues or in conceiling the bacterid ^face from Z 
host defense mechanisms {Navarre, W. et al., 1999). S. P neuinoniae is no! an ex^orTS Zs re^d 
Several surface proteins are characterized by as virulence factors, important for pneumococcal 

protSn^h^t * M " 2 ° 01) - * * *ese proteins could X Tette 

protechon to humans, they could provide the source of a novel, protein-based pneumococcal v^cLe to 

soT f I "f 1 ° r ^ Pl3Ce ° f ** m ° re ^onal capLar polysaciaiTv^e of 

i a ur: ^ , ' A " et ^ UU2 i resulted mainly from a selection based on easinp** of 

^srxLtr"' — * a demand ,o a ^ 



The present inventors have developed a method for idendfieaBon, isolation and production of 
patho^c funcdon^»en^™™? ) Sft S™ Ver ' "^T^ " bi< " 08iCal 

ca^Ls of S tLZST 5 ? * Carriage ° f * P ~ mflC - A percentage of young children are 
4— To ^^^ST^S^^^ 1 « 

S n carriage state couldn t be associated with antibody levels. Second, serum samples from 
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convalescent phase patients with invasive pneumococcal diseases were characterized and selected in the 
same way. The third group of sera, containing longitudinally collected samples were also obtained from 
individuals with invasive disease and were used mainly for validation purposes. The main value of this 
collection is that one can follow the changes in antigen-specific antibody levels before diase (prae-), at the 
time of onset (acute) and during recovery (convalescent). This latter group helps in the selection of 
epitopes, which induce antibodies during disease and missing in the prae-disease state. 

The genomes of the two bacterial species S. pneumoniae and S. aureus by itself show a number of 
important differences. The genome of S. pneumoniae contains app. 2.16 Mb, while S. aureus harbours 2 85 
Mb. They have an average GC content of 39.7 and 33%, respectively and approximately 30 to 45% of the 
encoded genes are not shared between the two pathogens. In addition, the two bacterial species require 
different growth conditions and media for propagation. While S. pneumoniae is a strictly human 
pathogen, S. aureus can also be found infecting a range of warm-blooded animals. A list of the most 
important diseases, which can be inflicted by the two pathogens is presented below. S. aureus causes 
mainly nosocomial, opportunistic infections: impetigo, folliculitis, abscesses, boils, infected lacerations, 
endocarditis, meningitis, septic arthritis, pneumonia, osteomyelitis, scalded skin syndrome (SSS), toxic 
shock syndrome. S. pneumoniae causes mainly community aquired infections: upper (pharyngitis, otitis 
media) and and lower respiratory infections (pneumonia), as well as bacteremia, sepsis and meningitis. 

The complete genome sequence of a capsular serotype 4 isolate of S. pneumoniae, designated TIGR4 was 
determined by the random shotgun sequencing strategy (GenBank accession number AE005672; see 
www.tigx |.org/tigrscripts/CMR2/CMRHomePage.spl). This clinical isolate was taken from the blood of a 
30-year-old male patient in Kongsvinger, Norway, and is highly invasive and virulent in a mouse model 
of infection. 



The problem underlying the present invention was to provide means for the development of 
medicaments such as vaccines against S. pneumoniae infection. More particularly, the problem was to 
provide an efficient, relevant and comprehensive set of nucleic acid molecules or hyperimmune serum 
reactive antigens from S. pneumoniae that can be used for the manufacture of said medicaments. 

Therefore, the present invention provides an isolated nucleic acid molecule encoding a hyperimmune 
serum reactive antigen or a fragment thereof comprising a nucleic acid sequence, which is selected from 
the group consisting of : 

a) a nucleic acid molecule having at least 70% sequence identity to a nucleic acid molecule selected 
from Seq ID No 1, 101-144. 

b) a nucleic acid molecule which is complementary to the nucleic acid molecule of a), 

c) a nucleic acid molecule comprising at least 15 sequential bases of the nucleic acid molecule of a) 
or b) ' 

d) a nucleic acid molecule which anneals under stringent hybridisation conditions to the nucleic 
acid molecule of a), b), or c) 

e) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid molecule defined in a), b), c) or d). 

According to a preferred embodiment of the present invention the sequence identity is at least 80%, 
preferably at least 95%, especially 100%. ' 

Furthermore, the present invention provides an isolated nucleic acid molecule encoding a hyperimmune 
cons^tm^Tr anti§en ° r 3 fragment ^ xeoi uprising a nucleic acid sequence selected from the group 
a) a nucleic add molecule having at least 96% sequence identity to a nudeic add molecule selected 
79, 81 94 q <£l00 * ^ ^ ^ ^ ^ * ^ ^ 55 " 62 ' * 67 ' 71 " 76 ' 78 " 
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b) a nucleic acid molecule which is complementary to the nucleic acid molecule of a), 

c) a nucleic acid molecule comprising at least 15 sequential bases of the nucleic acid molecule of a) 
orb) ' 

d) a nucleic acid molecule which anneals under stringent hybridisation conditions to the nucleic 
acid molecule of a), b) or c), 

e) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid defined in a), b), c) or d). 

According to another aspect, the present invention provides an isolated nucleic acid molecule comprising 
a nucleic acid sequence selected from the group consisting of 

a) a nucleic acid molecule selected from Seq ID No 9, 17, 24, 32, 37, 43, 52, 54, 65-66, 70, 80. 

b) a nucleic acid molecule which is complementary to the nucleic acid of a) 

c) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid defined in a), b), c) or d). 

Preferably, the nucleic acid molecule is DNA or RNA. 

According to a preferred embodiment of the present invention, the nucleic acid molecule is isolated from 
a genomic DNA, especially from a S. pneumoniae genomic DNA. 

According to the present invention a vector comprising a nucleic acid molecule according to any of the 
present invention is provided. 

In a preferred embodiment the vector is adapted for recombinant expression of the hyperimmune serum 
reactive antigens or fragments thereof encoded by the nucleic acid molecule according to the present 
invention. r 



invention. 



The present invention also provides a host cell comprising the vector according to the present 

According to another aspect the present invention further provides a hyperimmune serum-reactive 
antigen comprising an amino acid sequence being encoded by a nucleic acid molecule according to the 
present invention. & 

ot SeqS No l^SSST ** ^ b S6leCted * e «»«»■** 

In another preferred embodiment the amino acid sequence (polypeptide) is selected from the group 

f^Tt q ° 146 " 15 °' 152 ' 154_160 ' 162 ' 167 ' 169 - 175 ' 178 ' 180 ' 182 -!86, 188, 191-192, 195 197 
199-206,208,211,215-220,222-223,225-238,240-244. 

^ i f ^ er f ? refe ^ e x T em ?° diment 31111110 3dd Sf ** Uence (Polypeptide) is selected from the group 
consisting of Seq ID No 153, 161, 168, 176, 181, 187, 196, 198, 209-210, 214, 224. P 

According to a further aspect the present invention provides fragments of hyperimmune serum-reactive 

Cff^Jf 8MUp consistin S of P e P tid «* comprising amino acid sequences of column 

predicted immunogenic aa" and "location of identified immunogenic region" of Table 1; the serum 

55 a ?K ™ 6 2/ e8 P etSM 7 P e P tide * comprising amino acids 4-11, 35-64, 66-76, 101-108, 111- 
^ 114 ° f ^ m N ° 145; 5J17 ' 92 - 102 ' 107 " 113 ' H9-125, 133-139, 148-162, 177-187 195-201 

207-214, 241-251, 254-269, 285-300, 302-309, 317-324, 332-357, 365-404, 411-425, 443^ 4^477 479 487 

? 5 " 52 °' 532_547 ' 556 ' 596 ' 603 - 610 ' 616 - 622 ' 636-642 646-665 667^' Z^2 foM20 

162-167, 177-196, 45-109 and 145-172 of Seq ID No 147; 17-35, 64-75, 81-92, 100-119, 125-172, 174-183, 214^ 
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222, 230-236, 273-282, 287-303, 310-315, 331-340, 392-398, 412-420, 480-505, 515-523, 525-546 553-575 592 
™' Slo 9 ' 617 ' 625 ' 631 ^ 9 ' 644 " 651 ' **™> 681 " 687 ' 709-716 731-736 73^ tSS 5£ 

w^ 9 J'n 799 " 805 ' 809 ' 822 ' 859 - 870 ' 88 °- 885 ' 907 - 916 ' 924 " 941 ' 943 " 949 ' 973-986, 1010-1016, loSlSl 
1045-1054, 1057-1062, 1082-1088, 1095-1102, 1109-1120, 1127-1134, 1140-1146, 1152-1159 1169 117? 1187 

Zt^t^St 1273 ' 1279 " 1292 ' 1306 - 1312 ' 1332 ' 1343 ' 134 ^ 1364 ' ^J^^l^e] 
Ji ° ' 1524 - 1549 ' 1574-1588, 1614-1619, 1672-1685, 1697-1707, 1711-1720, 1738^1753 1781 

S" £ 38l"39 7 4 £5 r 227 ' 246 - 254 ' 279 287 ' 293 - 3 ° 8 ' ™-™> 330 ^ ^ 

Itl'T,' ' ' 41 ^ 422 ' 432 " 439 ' 450 - 460 ' 464 " 474 ' 482 -494, 501-508, 521-529, 536-546 553-558 

^t^iS^IS^ 3 ^ 653 - 660 - 673 - 681 - 688 - 698 - ^ 720 - 726 73 ^ 55 

do If ' ^ 549-647 ° f Seq 10 No 149; 9 - 30 ' 99 " 123 ' 170-178 and 1-128 of Seq ID No 

150; 7-32, 34^1, 96-106, 127-136, 154-163, 188-199, 207-238, 272-279, 306-312 318-325 341-^7l 5 3-360 

TIT'^TL*^' 452 - 5 ° 3 ' 575 - 58 °' 589 ' 601 ' 615 " 620 ' ^ 654-660, SSS, JSJS 

Sec fm %o 152 9 24 °? 0 eq ^ N ^S ; tSo 35 - 44 ' 48 " 59 ' ^ 93 " 99 ' 13 °- 138 ' 146 - i6 l - d 78 ^ 

Seq ID No 152; 24-30, 36^3, 64-86, 93-99, 106-130, 132-145, 148-165, 171-177, 189-220 230-249 251 263 

293-300, 302-312, 323-329, 338-356, 369-379, 390-412 and 179-193 of Seq ID No 153 30*9 liof 7 SV £ 

^^o 5 ! 167 ' 169 " 179 ' 182 " 197 ' 200 - 206 ' 238 " 244 ' 267 " 272 - d 23 °- 2 « ££m££i ItS £ 

65, 77-86 and 2-68 of Seq ID No 155; 4-9, 26-35, 42-48, 53-61, 63-85, 90-101, 105-111 113-121 129 137 140- 

^'r'? 8 ' 1 "' 226 ' 228 - 237 ' 248 - 255 ' 259 " 285 ' »M08t 314-331, 337-343 3^-36^, 410^ 436^ atd 
110-144 of Seq ID No 156; 36^7, 55-63, 94-108, 129-134, 144-158, 173-187, 196-206 209 238 251 ^ 2^0 
285, 290-295, 300-306, 333-344, 346-354, 366-397, 404^10, 422-435 439-453 466^' IZS SIS' 
569, 571-585, 590-596, 607-618, 627-643, 690-696, 704-714, 720-728, J^^^ vS?^ ^? ^ 
480-507 of Seq ID No 157; 16-25, 36-70, 80-93, 100-106 and 78-130 of Sec ID No 158 l^' S 6 50^7 
65-71 79-85, 93-98, 113-128, 144-155, 166-178, 181-188, 201-207, 242-262, 2^ 28^9^ ^SSJtSS 

296-302, 326-353, 360-370, 378-384, 400-405, 409-418, 420-435, 442-460, 499-506 529-534 55^562 sits™ 
644-651, 677-684, 687-698, 736-743, 759-766, 778-784, 808-814 852-858 874-896! 9^925 SJS 95^2' 
003-1012, 1021-1027, 1030-1044, 1081-1087, 1101-1111, 1116-1124, 1148-1159, 188 1196 

1328 " 1335 ' 1367 " 1373 ' 1431 - 1437 ' ^-l 458 ' 1 479 -1503, BlilSSSJ^S' 
1561-1568, 1598-1605, 1617-1647, 1658-1665, 1670-1676, 1679-1689, 1698-1704 1707-17U i £5 173SM E2 
1764, 1-70, 154-189, 922-941, 1445-1462 and 1483-1496 of Seq ID No 160; 6-st 8lTl IO4Y13 12^V lot 
159, 164-174, 197-209, 215-224, 229-235, 256-269, 276-282, 307-313, 317-348 351-35^ '111 t»7 IS' IS" 
464, 485-490, 498-509, 547-555, 574-586, 602-619 and 452-530 of Seq ID No 161 Si £ ^'49 £ fit 

E^^^fJ ^ 2 S 271 ' 279 - 303 - 315 ' 365 - 382 - 395 ^ ^^^££S££S 

ID No 162; 9-24, 41-47, 49-54, 68-78, 108-114, 117-122, 132-140, 164-169, 179-186 193-199 206-213 244.2m 
267-274, 289-294, 309-314, 327-333, 209-249 and 286-336 of Seq ID No ^^^'^SVit^ 

» ^2?2 f 1 " 2 ° 7 ' 22 °- 227 ' 242 " 247 ' 262 - 268 ' 305 - 318 ' 32 °- 325 ofSeqIDNol'64 So S 

39 47-58, 63-73, 86-96, 98-108, 115-123, 137-143, 148-155, 160-176, 184-189, 194-204 135-240 254^259 272 
278 and 199-283 of Seq ID No 165; 4-26, 33-39, 47-53, 59-65, 76-83, 91-97 loTll2 S W 
174, 198-207, 242-268, 273-279, 292-315, 320-332, 345-354, 358^367, 377^ 40^10 £g 445-45l' 453~ 

tf^ 1 ' 518 ' 535 " 57 °' 573 " 589 ' 60 ^ 6W ™ d 202 "242 of Seq ID No 1^30 3S tSf 76 2" 

97-103, 105-112, 134-151, 161-183, 211-234, 253-268, 270-276, 278-2^4, 297-305 309-31 ^3^ 366^72 

S SS- £2? ^ T 470 - 475 - 480 - 490 - 497 - 501 - 513 - 524 -^ 552 -St SS5 £5 

592-600, 619-625, 636-644, 646-656 and 316-419 of Seq ID No 167; 4-17, 52-58 84-99 102-110 I14ll2n i2d_ 
135 143-158, 160-173, 177-196, 201-216, 223-250, 259^267, 269-275 and 167 oi Sfm N 1^8 ^ 6^6 57 67 

2 50 S^i^JSif"* 182 " 187 ' 203 - 209 ' 214 - 229 ' 246 " 280 » d i S 9 " ^o^Seq m No\69 J 

353 S 22 2 ^« 8 - 223 " 229 - 248 ' 261 - 264 - 276 - 281 - 288 - 303 - 308 - 313 - 324 - *™™> 

™J£ ™™ 514 " 523 ' 556 " 564 ' 6 ° 5 - 616 ' 62 °- 626 ' ^l' 636 ' 667 "683, 685-699, 710-719 

f' 751 " 756 ' 760_771 ' 779 ' 788 ' ^15-828, 855-867, 869-879, 897-902, 917-924 926-931 936-942 981 innn 
1006-1015, 1017-1028, 1030-1039, 1046-1054, 1060-1066, 1083^1092, 1099-111^ uSniolllfiZ^ 
1158, 1161-1171, 1174-1181, 1209-1230, 1236-1244, 1248-1254, V^^^^^^S^ 
1332-1354, 1359-1372, 1374-1380, 1384-1390, 1395-1408, 141M425, US^^tS^SS^St 
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Hie present invention also provides a process for producing a S. pneumoniae hyperimmune serum 
reactive antigen or a fragment thereof according to the present invention comprising expressing one or 
more of the nucleic acid molecules according to the present invention in a suitable expression system. 

Moreover, the present invention provides a process for producing a cell, which expresses a S pneumoniae 
hyperimmune serum reactive antigen or a fragment thereof according to the present invention 
comprising transforming or transfecting a suitable host cell with the vector according to the present 
invention. 6 v 

According to the present invention a pharmaceutical composition, especially a vaccine, comprising a 
hyperimmune serum-reactive antigen or a fragment thereof as defined in the present invention or a 
nucleic acid molecule as defined in the present invention is provided. 

In a preferred embodiment the pharmaceutical composition further comprises an immunostimulatory 
substance, preferably selected from the group comprising polycationic polymers, especially polycationic 
peptides, immunostimulatory deoxynudeotides (ODNs), peptides containing at least two LysLeuLys 
motifs especially klklllllklk, neuroactive compounds, especially human growth hormone, alumn, 
freund s complete or incomplete adjuvants or combinations thereof . 

In a more preferred embodiment the immunostimulatory substance is a combination of either a 
polycaooruc polymer and immunostimulatory deoxynudeotides or of a peptide containing at least two 
LysLeuLys motifs and immunostimulatory deoxynudeotides. 

polyargkimT 16 embodiment P°lycationic polymer is a polycationic peptide, espedally 

According to the present invention ihe use of a nudeic add molecule according to the present invention 
or a hyperimmune serum-reactive antigen or fragment thereof according to the present invention for the 
manufacture of a pharmaceutical preparation, especially for the manufacture of a vacdne against S 
pneumoniae infection, is provided. 6 

Also an antibody, or at least an effective part thereof, which binds at least to a sdective part of the 

XTd™^^" 163 ^ 6 or a fra8ment ^ eot according to p fesent is 

In a preferred embodiment the antibody is a monodonal antibody. 

In another preferred embodiment the effective part of the antibody comprises Fab fragments. 
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In a further preferred embodiment the antibody is a chimeric antibody. 
In a still preferred embodiment the antibody is a humanized antibody. 

pTienTm^eSon ^ ^° * hybrid ° ma CeU ,ine ' which P^es an antibody according to the 

Moreover, ttie present invention provids a method for producing an antibody according to the present 
invention, characterized by the following steps: 

o initiating an immune response in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof, as defined in the invention, to said animal 

• removing an antibody containing body fluid from said animal, and 

* producing the antibody by subjecting said antibody containing body fluid to further 
purmcation steps. 

Accordingly, the present invention also provides a method for producing an antibody according to the 
present invention, characterized by the following steps: S 

• initiating an immune response in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof, as defined in the present invention, to said animal, 

• removing the spleen or spleen cells from said animal, ' 

• producing hybridoma cells of said spleen or spleen cells 

• SgnS Aeretf 115 ^ Specific for said ^^une serum-reactive antigens or a 
' Prific2n*ep a s ntib0dy ^ CUWVati0n ° f ^ d ° ned h y bridoma ce ^ optionally further 

T^»™Z PI T d T OI Pr0dUCed aCC ° rdinS t0 ab ° Ve methods m ^ be used *e preparation of 
a medicament for treating or preventing S. pneumoniae infections. 

According to another aspect the present invention provides an antagonist, which binds to a 
hypenmmune serum-reactive antigen or a fragment thereof according to the present invention 

Such an antagonist capable of binding to a hyperimmune serum-reactive antigen or fragment thereof 
according to the present invention may be identified by a method comprising the foUowingSS- 

a) contacting an isolated or immobilized hyperimmune serum-reactive antigen or a fragment 

oe^t WnH 8 /° ? e P 7T inVenti ° n ^ 3 CBnd,date ^nist under condition to 
fr^L^ antaS ° niSt t0 S3id hyperimmune serum-reactive antigen or 

M^StoT^ ? 3 COmp ° nent Capable of P rovidin S « detectable signal in response to 
Aereo^and antagonist to said hyperimmune serum reactive antigen or fragment 

b> ^onL^T 6 "' 6 ° r 3bSenCe ° f 3 SlSnal generated 111 reS P onse to binding of the 
antagomst to the hypenmmune serum reactive antigen or the fragment thereof. 

£m~n S< f St " apaWe f 01 «* interaction activity of a hyperimmune serum-reactive 

2" ^ accordm g to *e P-sent invention to its Interaction partner mtf be 

identified by a method comprising the following steps- 7 

3) S2 SenUn leaCtiVe ° r 3 h "™' fra ^nt thereof according 

b> Zlt 15 an - i ? 1 teraCHOn Partner tC said hyperimmune serum reactive antigen or a fragment 
thereof, especially an antibody according to the present invention, a tragment 

c) allowmg interaction of said hyperimmune serum reactive antigen or fragment thereof to said 
interaction partner to form an interaction complex 



WO 2004/092209 



PCTVEP2004/003984 



-14- 

d) providing a candidate antagonist, 

e) allowing a competition reaction to occur between the candidate antagonist and the interaction 
complex, 

f) determining whether the candidate antagonist inhibits or reduces the interaction activities of the 
hyperimmune serum reactive antigen or the fragment thereof with the interaction partner. 

£^Tf^ Une 7T reaC ?7 e ° r tegnents thereof according to the present invention may 

be used for the isolation and/or purification and/or identification of an interaction partner of said 
hyperimmune serum reactive antigen or fragment thereof. 

The present invention also provides a process for in vitro diagnosing a disease related to expression of a 
hyperimmune serum-reactive antigen or a fragment thereof according to the present invention 

rr a X Sm8 « etermU t 8 ^ ° f 3 nUCl6iC 3dd Se< * UenCe Said hyperimmune senun" 

reactive antigen or fragment thereof according to the present invention or the presence of the 
hyperimmune serum reactive antigen or fragment thereof according to the present invention. 

The present invention also provides a process for in vitro diagnosis of a bacterial infection, especially a S 
pneumoniae infection, comprising analyzing for the presence of a nucleic acid sequence encoding said 

n^ZT^T m teaCtiVe 8ntigen M fraSment Aereo£ accordin S to Present invention or the 
motion hyperimmune serum reactive antigen or fragment thereof according to the present 

th?w *!! P'T"* tav « Hon P rovide * *e use of a hyperimmune serum reactive antigen or fragment 
thereof according to the present invention for the generation of a peptide binding to said hyperinfmune 
serum reactive antigen or fragment thereof, wherein the peptide is an anticaline. ^ 

The present invention also provides the use of a hyperimmune serum-reactive antigen or fragment 
tiiereof according to the present invention for the manufacture of a functional nucleic acid, wherein the 
functional nucleic acid is selected from the group comprising aptamers and spiegelmers. 

The nucleic acid molecule according to the present invention may also be used for the manufacture of a 
functional ribonucleic acid, wherein the functional ribonucleic acid is selected from the group comprising 
nbozymes, antisense nucleic acids and siRNA. p s 

^hS^h ™?*? adv fu SeOUSly prOVideS m efficta * relevant comprehensive set of isolated 
rt^lTt < h yP erimm "ne serum reactive antigens or fragments thereof 

identified from S. pmunumm using an antibody preparation from multiple human plasma pools and 

a^dl ST 0 " ? deriVed ^ * e Sen ° me ° f * Vneumoniae. Thus, the present invention fulfils 
io?™L *t ^ J° X S-rTT™ antigenS ' VaCdneS ' dia 8™^ and products useful in procedures 
for preparing antibodies and for identifying compounds effective against S. pneumoniae infection. 

a^d e lT i rbir^% Sh °^t b t C T POSed ° f Pr ° teins ° r Peptides, which are expressed by all strains 
and are able to induce high affinity, abundant antibodies against cell surface components of S 

Sh ° Uld ^ I§G1 «*■ ^ f ° r opsoSon, and any IgG sub^Tand g A 
coZZtT ^\ u^"^ t0ydn ^ A dhmSe ^ d6fined Vaccine mus * be defiruSy superior 
re^fwitn huml t VaC ^J att ^ ted ° r ^ S ™ e «™P°nents of S. pneumoniae, wluchLss- 
ProTec^vt zZZ V* ***** °P SOnization can be eliminated, and the individual proteins inducing 
protective antibodies and/or a protective immune response can be selected. S 

The approach, which has been employed for the present invention, is based on the interaction of 
pneumococcal proteins or peptides with the antibodies present in human sera. The antibodt ^ prtduced 
agamst S. pneumoniae by the human immune system and present in human sera are mdicJve oi Sfc 
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vivo expression of the antigenic proteins and their immunogenicity. In addition, the antigenic proteins as 
identified by the bacterial surface display expression libraries using pools of pre-selected sera, are 
processed in a second and third round of screening by individual selected or generated sera. Thus the 
present invention supplies an efficient, relevant, comprehensive set of pneumoococcal antigens as a 
pharmaceutical composition, especially a vaccine preventing infection by S. pneumoniae. 

In the antigen identification program for identifying a comprehensive set of antigens according to the 
present invention, at least two different bacterial surface expression libraries are screened with several 
serum pools or plasma fractions or other pooled antibody containing body fluids (antibody pools) The 
antibody pools are derived from a serum collection, which has been tested against antigenic compounds 
of S. pneumoniae, such as whole cell extracts and culture supernatant proteins. Preferably, two distinct 
serum collections are used: 1. With very stable antibody repertoire: normal adults, clinically healthy 
people, who are non-carriers and overcame previous encounters or currently carriers of S. pneumoniae 
without acute disease and symptoms, 2. With antibodies induced acutely by the presence of the 
pathogenic organism: patients with acute disease with different manifestations (e.g. S. pneumoniae 
pharyngitis, pneumonia, bacteraemia, peritonitis, meningitis and sepsis). Sera have to react with multiple 
Pneumococcus-specific antigens in order to be considered hyperimmune and therefore relevant in the 
screening method applied for the present invention. 

The expression libraries as used in the present invention should allow expression of all potential antigens 
e.g. derived from all secreted and surface proteins of S. pneumoniae. Bacterial surface display libraries will 
be represented by a recombinant library of a bacterial host displaying a (total) set of expressed peptide 
sequences of S. pneumoniae on two selected outer membrane proteins (LamB and FhuA) at the bacterial 
host membrane {Georgiou, G., 1997}; {Etz, H. et al., 2001). One of the advantages of using recombinant 
expression libraries is that the identified hyperimmune serum-reactive antigens may be instantly 
produced by expression of the coding sequences of the screened and selected clones expressing the 
neTess^ 11116 Semm " reaCtive ^S 6 ™ without further recombinant DNA technology or cloning steps 

The comprehensive set of antigens identified by the described program according to the present 
mvention is analysed further by one or more additional rounds of screening. Therefore individual 
antibody preparations or antibodies generated against selected peptides, which were identified as 
immunogenic are used. According to a preferred embodiment the individual antibody preparations for 
the second round of screening are derived from patients who have suffered from an acute infection with 
S. pneumoniae, especially from patients who show an antibody titer above a certain minimum level, for 
example an antibody titer being higher than 80 percentile, preferably higher than 90 percentile, especially 
higher than 95 percentile of the human (patient or healthy individual) sera tested. Using such high titer 
individual antibody preparations in the second screening round allows a very selective identification of 
the hyperimmune serum-reactive antigens and fragments thereof from S. pneumoniae. 

Following the comprehensive screening procedure, the selected antigenic proteins, expressed as 
recombinant proteins or in vitro translated products, in case it can not be expressed in prokaryotic 
expression systems, or the identified antigenic peptides (produced synthetically) are tested L a second 
screening by a series of ELBA and Western blotting assays for the assessment of their immunogenicity 
with a large human serum collection (minimum -150 healthy and patients sera). 

It is important tfiat the individual antibody preparations (which may also be the selected serum) allow a 
tZ Tlu 1 f t ent,&Catl0n ° f me moSt P romisi ng candidates of all the hyperimmune serum-reactive antigens 
from all the promismg candidates from the first round. Therefore, preferably at least 10 individual 

2 S* ^ ° dy P** 313 * 0118 < e -S' sera > from at least 10 different individuals having 
second T * Ch ° Sen Path ° gen) should be used m identifying these antigens in thl 

second screerung round. Of course, it is possible to use also less than 10 individual preparations, 



WO 2004/092209 



PCT/EP2004/003984 



-16- 

however, selectivity of the step may not be optimal with a low number of individual antibody 
preparations. On the other hand, if a given hyperimmune serum-reactive antigen (or an antigenic 
fragment thereof) is recognized by at least 10 individual antibody preparations, preferably at least 30, 
especially at least 50 individual antibody preparations, identification of the hyperimmune serum-reactive 
antigen is also selective enough for a proper identification. Hyperimmune serum-reactivity may of course 
be tested with as many individual preparations as possible (e.g. with more than 100 or even with more 
than 1,000). 

Therefore, the relevant portion of the hyperimmune serum-reactive antibody preparations according to 
the method of the present invention should preferably be at least 10, more preferred at least 30, especially 
at least 50 individual antibody preparations. Alternatively (or in combination) hyperimmune serum- 
reactive antigens may preferably be also identified with at least 20%, preferably at least 30%, especially at 
least 40% of all individual antibody preparations used in the second screening round. 

According to a preferred embodiment of the present invention, the sera from which the individual 
antibody preparations for the second round of screening are prepared (or which are used as antibody 
preparations), are selected by their titer against S. pneumoniae (e.g. against a preparation of this pathogen, 
such as a lysate, cell wall components and recombinant proteins). Preferably, some are selected with a 
total IgA titer above 2,000 U, especially above 4,000 U, and/or an IgG titer above 5,000 U, especially above 
12,000 U (U = units, calculated from the OD^™ reading at a given dilution) when the whole organism 
(total lysate or whole cells) is used as antigen in the ELISA. 

The antibodies produced against streptococci by the human immune system and present in human sera 
are indicative of the in vivo expression of the antigenic proteins and their immunogenicity. The 
recognition of linear epitopes recognized by serum antibodies can be based on sequences as short as 4-5 
amino acids. Of course it does not necessarily mean that these short peptides are capable of inducing the 
given antibody in vivo. For that reason the defined epitopes, polypeptides and proteins are further to be 
tested in animals (mainly in mice) for their capacity to induce antibodies against the selected proteins in 
vivo. 

The preferred antigens are located on the cell surface or are secreted, and are therefore accessible 
extracellularly. Antibodies against cell wall proteins are expected to serve multiple purposes: to inhibit 
adhesion, to interfere with nutrient acquisition, to inhibit immune evasion nand to promote phagocytosis 
{Hornef, M. et al., 2002}. Antibodies against secreted proteins are beneficial in neutralisation of their 
function as toxin or virulence component. It is also known that bacteria communicate with each other 
through secreted proteins. Neutralizing antibodies against these proteins will interrupt growth- 
promoting cross-talk between or within streptococcal species. Bioinformatic analyses (signal sequences, 
cell wall localisation signals, transmembrane domains) proved to be very useful in assessing cell surface 
localisation or secretion. The experimental approach includes the isolation of antibodies with the 
corresponding epitopes and proteins from human serum, and the generation of immune sera in mice 
against (polypeptides selected by the bacterial surface display screens. These sera are then used in a 
third round of screening as reagents in the following assays: cell surface staining of S. pneumoniae grown 
under different conditions (FACS or microscopy), determination of neutralizing capacity (toxin, 
adherence), and promotion of opsonization and phagocytosis (in vitro phagocytosis assay). 

For that purpose, bacterial E. coli clones are directly injected into mice and immune sera are taken and 
tested in the relevant in vitro assay for functional opsonic or neutralizing antibodies. Alternatively, 
specific antibodies may be purified from human or mouse sera using peptides or proteins as substrate. 

Host defence against S. pneumoniae relies mainly on opsonophagocytic killing mechanism. Inducing high 
affinity antibodies of the opsonic and neutralizing type by vaccination helps the innate immune system to 
eliminate bacteria and toxins. This makes the method according to the present invention an optimal tool 
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for the identification of pneumococcal antigenic proteins. 

The skin and mucous membranes are formidable barriers against invasion by streptococci However 
once the skin or the mucous membranes are breached the first line of non-adaptive cellular defence 
begins its coordinate action through complement and phagocytes, especially the polymorphonuclear 
leukocytes (PMNs). These cells can be regarded as the cornerstones in eliminating invaing bacteria. As 
Streptococcic pneumoniae is a primarily extracellular pathogen, the major anti-streptococcal adaptive 
response comes from the humoral arm of the immune system, and is mediated through three major 
mechanisms: promotion of opsonization, toxin neutralisation, and inhibition of adherence. It is believed 
that opsonization is especially important, because of its requirement for an effective phagocytosis. For 
effiaent opson^tion the microbial surface has to be coated with antibodies and complement factors for 
recognition by PMNs through receptors to the Fc fragment of the IgG molecule or to activated C3b After 
opsonization, streptococci are phagocytosed and killed. Antibodies bound to specific antigens on the cell 
surface of bacteria serve as ligands for the attachment to PMNs and to promote phagocytosis. The very 
same antibodies bound to the adhesins and other cell surface proteins are expected to neutralize adhesion 
and prevent colonization. The selection of antigens as provided by the present invention is thus well 
suited to identify those that will lead to protection against infection in an animal model or in humans. 

According to the antigen identification method used herein, the present invention can surprisingly 

£Z » a * ?Tc ^ n ° VeI nUdGiC 3CidS Bnd n ° vel hyperimmune serum reactive antigens and 
fragments thereof of S. pneumoniae, among other things, as described below. According to one aspect the 
invention particularly relates to the nucleotide sequences encoding hyperimmune serum reactive 
antigens which sequences are set forth in the Sequence listing Seq ID No: 1-144, 289-303 and the 
corresponding encoded amino acid sequences representing hyperimmune serum reactive antigens are set 
forth in the Sequence Listing Seq ID No 145-288 and 304-318. 

™ emb ° dimei ; t 01 *" P resent mven *on, a ™*efc acid molecule is provided which exhibits 

70 /o identity over their entire length to a nucleotide sequence set forth with Seq ID No 1, 101-144 Most 
highly preferred are nucleic acids that comprise a region that is at least 80% or at least 85% identical over 
their entire length to a nucleic acid molecule set forth with Seq ID No 1, 101-144. In this regard nucleic 
acid molecules at least 90%, 91%, 92% 93%, 94%, 95% or 96% identical over their entire lengm to me 

Ea?98o/ ? " it? ly ?£r e± FUrth f rm ° re ' th ° Se *** at least 97% ™ preferred, thofTwith at 

least 98 /o and hat least 99% are particularly highly preferred, with at least 99% or 99.5% being the more 
preferred, with 100% identity being especially preferred. Moreover, preferred embodiment Z 
Z^IT H- f S hyperimmune serum reactive ^tigens or fragments thereo 

^ESSS^h SUbStantiaUy ** S3me bi ° 1 °S ical or activit y ^ the mature 

polypeptide encoded by said nucleic acids set forth in the Seq ID No 1, 101-144. 

38 ^ * ^ ^ ^ hCrein ' * relation *iP between two or more polypeptide 

^™° r a to OT mOTe r'T^t 0 SeqU6nCeS ' 38 defermined ^ COm P--8 the sequenced the 
art, identity also means the degree of sequence relatedness between polypeptide orlolynucleotide 
sequences, as the case may be, as determined by the match between strings of such sequent WeX 
can be readdy calculated. WhUe there exist a number of methods to measure identity berweTtw^ 

Aril t mT *r\ P( : lyPeptide SeqUenC6S ' tenn b Wdl teown to skiUed -tisans l^ueZce 
fdeS M ; le f Ular * Wl °W' Von Hei ^' G., Academic Press, 1987). Preferred methods to^etermine 

2 JTZ ^ larg6St ^ b6tWeen me se <* uences tested. Methods to determine 
bet^n^o « m COI 7T J* 10 *?*™- Preferred ^P"ter program methods to determine identity 

Sffi^tir^ prosram package id — j et *• i9mX 

iTStlTth^ 7"* °1 inVenti ° n ' nUd6iC add m ° IeCuleS 316 P rovided whi * exhibit at least 
96 /„ identity to the nucleic aad sequence set forth with Seq ID No 2-6, 8, 10-16, 18-23, 25-31, 34, 36, 38^2, 
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44, 47-48, 51, 53, 55-62, 64, 67, 71-76, 78-79, 81-94, 96-100. 

1 3 ^ ° f ^ PieSent inVenti ° n ' nudeic acid molecules « Provided which are 

identxcal to the nucleic aad sequences set forth with Seq ID No 9, 17, 24, 32, 37, 43, 52^66,^0 

The nucleic acid molecules according to the present invention can as a second alternative also be a nucleic 
aad molecule which is at least essentially complementary to the nucleic ada^SS ^ as thTS 
alternatave above. As used herein complementary means that a nucleic acid sIxanT^Le pairt. vTa 
Watson-Cnck base pairing with a second nucleic acid strand. Essentially co^np^^^^ 

"iVo7/:™i: tzt* for au :r - bases of 

au. i.umper or percentage of the bases unpaired or wrongly paired. The percenter n f 
pairmg bases is preferably at least 70 %, more preferably 80 %, even moR ^refe^Z 9?oI aS ^2 
preferably any percentage higher than 90 %. It is to be noted that a percentage ^^l^A^t^l 

' DNA a^" Perf0nned 6 ' S - ta 5 X 5 X <"% SDS, 100 g/mL sheared 

• Moderate stringency wash in 0.2xSSC, 0.1% SDS at 42°C 

• High stringency wash in O.lxSSC, 0.1% SDS at 68°C 

a^^^' ° f 50% ha! ^ T " ° f 1% — * *a T„ is 

to addition, any of *a tahar hybridization conditions described hasain ara in psincipla appiicabla as 

the dagan^o^^rco^^rS^ "'IT" 5 °, 2 f - ^ «~ 

a(DNA)chip),etc) Preferred T § ' T ^ } PrUnerS/ P robes - ca P^e molecules (e.g. on 
or more of 2 ^^^^ * 3 C ° ntiSU ° US 15 base P ortion <* one 



WO 2004/092209 

PCT/EP2004/003984 

and/or any other published S im«»JL T Uiettelin ' et ah ' 2001 }; GenBank accession AE005672) 
((Hoskir/j. et al" SfSiS ac ™E007^ e ? T "T* ° f *" 

residues are residues, which lead to Tnon tEST" preferred ^on-identical nucleic acid 

sequences encode for polypeptide °iZ at^eS 7™ St^ 6 - *» nudeic acid 

different amino add residue! compared TV Tu \ f * leaSt 2 ' P ref ^ly at least 3 

Also such isolated po Wptide^ ^tJ^fVt^ V™™™ ^terparts mentioned above. 
e.g. in the sequence H^na^f it^ "? * f" ^ < W *" whole protein) mentioned herein 
nucleic acids are preferred ' ' 8 amm ° aad residues ^ bei *S encoded by these 

Srm^wS^^ «? a *»* — also be a nucleic 

present invention according to foHbTe ou^T T * T * ** of * e ™**<= of the 
hybridisationconditionsa^ S6COnd ' - d ** d Amative. Stringent 

nuclei ^^^TuT^Z^ T? , ~ ^ - 3 - *e a 

nudeic add molecules^ordfog *££SEZ t* ^fV***' WOuld ^ridise to any of the 
first, second, third, and faJ3^"S^S£?££ tTZT*!?*** 4 ° 
the fact that preferably the nudeic adds a^rdfaTl * ° f nUd-C add molecule re£ers to 

serum reactive antieens «. fra " ,13 ° me P resent mention code for the 

hyperimmune 

molecule is particularly usefuT^ deTe^on ^Tf preS ff tovertton - ™ s *nd of nudeic add 
invention and thus the LenTis of foe rZ^T ^ m ° leCUle accordin S to present 

or diseased condition W o XoT ™* a * S. P „ eM «e and any disease 

occur or be preformed ^e^^SZ^T " ^-tion would 

described above. 8 conditions as descnbed m connection with foe fourth alternative 

dt^bo^cle" -y ribonudeic acid molecule or 

Thus, for instance, nucleic adH ^T?JZ ™ dlfted . A or DNA or modified RNA or DNA. 
sanded DNA DNA that is a nuxmrl ^l t ^ ^ff referS to ' amon S other - single-and double- 
single- and dtbleJand^^^ ^ ** d that is a mixture of 
stranded or, more typically, ZZT^^^^T ^ ^ ^ 
stranded regions. In addition, nuddc add mo^Z Stranded ' ° r a nuxture of s *gle- and double- 
comprising RNA or DNA or bofo RNA arfd DNA^hJ " t0 ^P^-ded regions 
molecule or from different moleculer^?r;«S! ^ " SUCh regi0ns ma ^ be ^ the same 
more typically involve ordyTrSfon oTso™ TT^ T , * * ^ ° r ^ ° f * e molecules ' ^ut 
region often i an oBgo^Tl ^h^r^ °T ° f *» m ° leCules of * triple-helical 
RNAs as described ab^ fo at comainor^nr ^ ^ ""^ add molecule DNAs or 
modified for s^^^^^^^^J^ ^ ° NAs ° r with backbones 
Moreover, DNAs oT RNAs oZZT^LT£L T " ** tWn * inteuted herei - 
tritylated bases, to name just two JSL^SL^T^ " , *' " m0dified bases ' such - 
appreciated that a great vari^ oT^ 

useful purposes kno^n to fooseVsl^^ T * ° NA 3nd ^ ^ -any 

embraces such chemicaUy, enzymati^lly ?S2^^^'^- ttb -^^ 
well as foe chemical forms of DNA and 1 RNA Xa^T^ ^ rmS ° f nUcleic acid m °lecule, as 
complex cells, inter alia. The term nucldc aTd ^ST* k^ 68 ^ ^ s-ple and 

referred to as oligonudeotide(s). i^SSSdSS^ ^ *"* ""^ add m ° leCuleS often 
used interchangeably herein. 1 ° lynUCleotlde ^ n «clexc aad" or "nudeic add molecule" are often 

Nudeic add molecules provided in foe present invention also encompass numerous unique fragments, 
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boA longer and shorter than the nucleic acid molecule sequences set forth in the sequencing listing of the 
^—cockg regions, which can be generated by standard cloning methoas. ToSla 
fragment must be of sufficient size to distinguish it from other known nucleic acid s^ZZ Zst 

selected s — " fragment to - — de%™ 

Additionally, modifications can be made to the nucleic acid molecules and polypeptides that are 

XT?? 7 * H Pr6Sen i ^r ti ° n - F ° r nUCleotide substitutions can be maS wWc^o not 

f^tt P ° lypepfade * * e ^ ^ic add, and thus any nucleic acid molecule wlul encodes a 

hypenmmune serum reactive antigen or fragments thereof is encompassed by the present invltTon 

f^^° r J ^ ^/"fl* acid molecu ^ coding hyperimmune serum reactive antigens or 
fragments thereof provided by the pre sent invention can be functionally linked u^SaS 
techniques such as standard cloning techniques, to any desired regulatory sequence ^wheu^ a S 

EZT r6gU I* 017 SCqUenCe ° r 3 heterol °S°- -Sulatory sequent iZ^^J^Z^ 
heterologous marker sequence or a heterologous coding sequence to create a fusion protdn 

Nucleic acid molecules of the present invention may be in the form of RNA, such as mRNA or cRNA or 
^hZ^^T*™ further , relates to varian * of the herein above described nucleic acid molecules 

bequence Listing, in which several a W/ r^ihi^rix o o ^ M ^ &et Iortn m me 

P^Sr y d one" r E *° *• present invention also include modified epitopes wherein 

rulef oldosed intg (ToSdTT J? 2000^ *f** T 01 a « * *e 

modified epitope^. ' ^ ^ 38 We " 38 *" nudeic acid encoding such 

™™^ s ^^?:r s ^ c i? om ^- pies r epitopes by -*» add ^0^ 

covered by LeXp^^XlTZ f 6 * ^f 1 ™** ca P a ™* <* epitopes are 
epitopes, which do not conZt? ?£1 T "T ^ pr6Sent ^P 68 3180 cov <* 
or pref^ably an ta^^SToS "ST* > T * P ~" fae ' but the same 

y proved T cell response. These epitope are referred to as "heteroditic"; they need to 



WO 2004/092209 



PCT/EP2004/003984 



^T^r 131 Z ^^J. t0 MH ^ LA moIecules < md * e ™* d the ability to stimulate 

the T cell receptors (TCR) directed to the original epitope in a similar or preferably stronger manner. 

Heteroclitic epitopes can be obtained by rational design Le. taking into account the contribution of 
mdmdual residues to bmdmg to MHC/HLA as for instance described by {Rammensee, H. et al., 1999} 
combined with a systematic exchange of residues potentially interacting with the TCR and testui the 
resulhng sequences with T cells directed against the original epitope. luch a desigf is'osStefofa 
skilled man in the art without much experimentation. po^iwe a 

pother possibility includes tfie screening of peptide libraries with T cells directed against the original 

be^tn H P ° Siti0nal SCannmg ° f P e P tide Suiapproaches W 

been described in detail for mstance by {Hemmer, B. et al., 1999}and the references given therein. 

As an alternative to epitopes represented by the present derived amino acid sequences or heteroclitic 
beappSd mimiddng **" 6g - "P^emimetica- or "retSmverso-pepuS- ^ 

Another aspect of the design of improved epitopes is their formulation or modification with substances 
mcreasing their capacity to stimulate T cells. These include T helper cell epitopes, lipids 2 HpoVom^ or 
preferred modifications as described in WO 01/78767. liposomes or 

Another way to increase the T cell stimulating capacity of epitopes is their formulation with immune 
stunulaung substances for instance cytokines or chemokines like mterleukin-2, -7, -12 -IsXsI^nd n 
interferons (IFN), especially IFN-gamma, GM-CSF, TNF-alpha, flt3-ligand and others 

As discussed additionally herein regarding nucleic acid molecule assays of the invention for instance 

SST cdSITiS 601168 of ^r vention 38 discussed above ' ™y be - ■ hybrid P ~bTS 

of £ ? 8en0nUC DNA *° iS ° late full " len S th cDNA * *™» genomic clones encoding polypeptides 

of the presen invention and to isolate cDNA and genomic clones of other genes maf LvTa Wh 
sequence similarity to the nucleic acid molecules of the present invention. Such probes senerX wiU 
comprise at least 15 bases. Preferably, such probes will have at least 20, at least 25 or^st^b^ Ind 
may have at least 50 bases. Particularly preferred probes will have at least 30 bases, and wiU have 50 
bases or less, such as 30, 35, 40, 45, or 50 bases. 

*Zl : Xample ' ?* C °1i S regi ° n ° f a nUCl6iC add molecule of *» P resent invention may be isolated bv 
27,' rele ^ant library using the known DNA sequence to synthesize an oligonucleotide probe I 
abeled oligonucleotide having a sequence complementary to that of a gene of the present invention Z 

tSX ™ridt7 ° f ^ NA ' g6n0miC ° NA ° r ^ t0 d ~ to ^ niemb~e S 

The nucleic acid molecules and polypeptides of the present invention may be employed as reagents and 
materials for development of treatments of and diagnostics for disease, particularly hum^ ftease at 
further discussed herein relating to nucleic acid molecule assays, inter aUa. 

The nucleic add molecules of the present invention that are oligonucleotides can be used in the processes 
h ' J? f ° r PCR - *° determine - hether or ~t the S. P irZ 

ec^c 1 ti^T ° r ^ Part Pr6Sent ^ tamBcritad ^ **** **ue such as blood 8 ft! 

tha * SUCh S ^ uences wiU also have utility in diagnosis of the stage of infection and tvoe of 

LtX to™ 6 "' f ° r 0th6r PUrP ° Ss ^ -TOt^neTthl 

° aaas accordln S to the present mvention as described herein, may be used. 
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Hie nucleic add molecules according to the presenftavenUon may be used for the detection of nuclei, 
acrd mo leches and organisms or samples containing these nudeic adds. Preferably «3^S^S 

sezt" te - **— ° f a *— - * *■ p~ rzL f 5 

Eukaryotes (herein also "individual^), particularly mammals, and espedallv humans Infects »Hth c 

~ fer rCs^xr^r^^ est* - — 

„ . X r , quantitation of nucleic acid molecules, such as, for examde PCT? rt Pro 

Bnase protectton. Northern blotting other hybridisatton methods and the arrays S. h2n f 

nattu-aUy^urru^nuddcaSmole^e^auTvotH^ "IT"™" 1 - » »<*• F » «™P"* a 
natural state is nol "isolated " taTiT^ P°>ypeP"de naturally present in a living organism in its 

costing JL5£ 2£T£JS? stte"£- a l r'r 16 ot po,3 r ^ * & ° m 

foUowingtolatton, such nuddctdTmoL,'^ bel fa^th """^ ^ * ^ ° f ° r 

be in their naturally occurrW 1 as the term is used herein, because they would not 

enzymatic reacdons, for iSST W M ^^7^1 ' C ° mpOSl,to " s « *» *emlcal or 

method. In preferreSS^nS t^l™ „ l" 81 "* *" ^ dKCribed ""^ «i<=«i"Cation 
an amino .« «^TXT^wS£!!!£ ^™ «™™-*. anttgen comprising 
fragments thenTare provided fa ZT'^f aadS m0lecu,es hereta ™d 

hyperimmur* serum J^»tt!^^ «nbod,ment of the invention a novel set of 

consisting of the polypeptide J^SL le^t No l^^t^rZaT "» 

are provided. In a further preferred emhnH.JL,* *u • , ' 88 31141 fragments thereof 

which comprise -iS^Sd^STSSS^ h 5? erim — serum-reactive antigens 

NO 253. 2«, 168, 2 7 , 2S1, 2 87 , £ ^"^S^ZS^^T " ^ " 
nm hyperimmune serum teacttve antigens tmd fragments thereof as padded in dte inventton indude 
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any polypeptide set forth in the Sequence Listing as well as polypeptides which have at least 70% identity 
to a polypeptide set forth in the Sequence Listing, preferably at least 80% or 85% identity to a polypeptide 
set forth in the Sequence Listing, and more preferably at least 90% similarity (more preferably at least 
90% identity) to a polypeptide set forth in the Sequence Listing and still more preferably at least 95% 
96%, 97%, 98%, 99% or 99.5% similarity (still more preferably at least 95%, 96%, 97%, 98%, 99% or 99 5% 
identity) to a polypeptide set forth in the Sequence Listing and also include portions of such polypeptides 
with such portion of the polypeptide generally containing at least 4 amino acids and more preferably at 

5f?J '/ m ° re P referabi y at least 30 - stm more preferably at least 50 amino acids, such as 4, 8, 10, 20, 
30, 35, 40, 45 or 50 amino acids. 

The invention also relates to fragments, analogs, and derivatives of these hyperimmune serum reactive 
antigens and fragments thereof. The terms "fragment", "derivative" and "analog" when referring to an 
antigen whose amino acid sequence is set forth in the Sequence Listing means a polypeptide which 
retains essentially the same or a similar biological function or activity as such hyperimmune serum 
reactive antigen and fragment thereof . 

The fragment, derivative or analog of a hyperimmune serum reactive antigen and fragment thereof may 
be 1) one in which one or more of the amino acid residues are substituted with a conserved or non- 
conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino 
acid residue may or may not be one encoded by the genetic code, or 2) one in which one or more of the 
ammo acid residues includes a substituent group, or 3) one in which the mature hyperimmune serum 
reacbve antigen or fragment thereof is fused with another compound, such as a compound to increase the 
half-life of the hyperimmune serum reactive antigen and fragment thereof (for example, polyethylene 
glycol), or 4) one in which the additional amino acids are fused to the mature hyperimmune serum 
reactive antigen or fragment thereof, such as a leader or secretory sequence or a sequence which is 
employed for purification of the mature hyperimmune serum reactive antigen or fragment thereof or a 
proprotein sequence. Such fragments, derivatives and analogs are deemed to be within the scope of 
those skilled in the art from the teachings herein. 

The present invention also relates to antigens of different S. pneumoniae isolates. Such homologues may 
easily be isolated based on the nucleic acid and amino acid sequences disclosed herein. There are more 
than 90 serotypes in more than 40 serogroups distinguished to date and the typing is based on serotype 
specific antisera. The presence of any antigen can accordingly be determined for every serotype In 
addition it is possible to determine the variability of a particular antigen in the various serotypes as 
described for the S. pyogenes sic gene {Hoe, N. et al., 2001). The contribution of the various serotypes to 

^f^T nK T C>l i nfeCti ° i nS VariGS fa different age ^"P 8 and graphical ^ons {Gray, 
; S '• i { 7 ' 6t 1986}; { ° ranSe ' M - et d - 19931 ' reviewed in Epidemiology and Prevention 
of Vaccme-Preventable Diseases, 7th Edition-Second Printing, The Pink Book). It is an important aspect 
that the most valuable protective antigens are expected to be conserved among various clinical strains 

Among the particularly preferred embodiments of the invention in this regard are the hyperimmune 
serum reactive antigens set forth in the Sequence Listing variants, analogs, derivatives and fragments 
thereof, and variants, analogs and derivatives of fragments. Additionally, fusion polypeptides 
comprising such hyperimmune serum reactive antigens, variants, analogs, derivatives and fragments 
thereof, and vanante, analogs and derivatives of the fragments are also encompassed by the present 
mvention. Such fusion polypeptides and proteins, as well as nucleic acid molecules encoding them, can 
reaculy be made using standard techniques, including standard recombinant techniques for producing 
and expression of a recombinant polynucleic acid encoding a fusion protein. 

^r S K «lT d m 01086 ** Var y bom a reference b y conservative amino acid substitutions. 

ukfcn^ 2" "? SUbStitUte 3 ^ amin ° add m 3 P^Ptide by another amino acid of 

like characteristics. Typically seen as conservative substitutions are the replacements, one for another 
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residues are substituted, deleted or aHH^H 77 & . 10 ' 1 to 5 ' 1 to 3 ' 2 ' 1 or no amino acid 

silent substitutions, additio^d ?S ^ °^ ****** prefOTed ™ 

polypeptide of the p^^SST^^^^ ~ * ^ 

S^e^tin^^ ~ a 3SL 

variantsandderivativesof me polypeptides tte^cel^ ^ ^ * 

£t™tX^ 

fragment mereof^^n^^tive 1°T "T^ hyP6r ™ 6 *™ -active antigenLd 
standing", U., not h 8 "^ ^ SUCh ^ be 

within a larger polypeptide of wWc^ ^1 P^P^es, or *«* may be comprised 

invention are tSJSSt^^^^l f ^ ^ P referred * ** «P«* of the 
present mvention^e. fn^^^JZ^Tt I ° f *" ******** of the 

and beta-sheet fo^g reS tu^Ld^f^ ^ l 1 ^ 1 * fanata » bet *- sh «* 
regions, hydrophobic ^Sr'Sa am^aZ" 8 • egl ° 1 l!' ^ Coil - formin S hydrophilic 
surface-foLng^^ regions beta-amphipathic regions, flexible regions, 

of the present m veS^ 32 C^SE^."*™ °< - P« e 

activities of the hyperimmune serum react,™ II ^erred re gl ons are those that mediate 

Most hishlv prefe red in «X ^^'T™™*™ ^ *««* <* *» P^ent invention. 

hyperimmune serum reac^ S^JKSS ^tTT* 1 ' 1 ^"^'^^ 
with a similar activity or an imnrS™n « ^ £ present mutton, including those 

preferred are fr^Zsc^Z^^t T * ' f""*" **** ParbCulari y 

viability of S. Xntcn^ iTlSTZ^T^ T 5 ™* 3 

fragments are mose that comprise ^ntam^nH " preferred Peptide 

especially in a human. ' anUgemc or immunogenic determinants in an animal, 

X^de^c^^ ^ d which is for itself antigenic or 

showing one or (for longer fra^S onTv ,^ 3180 «*8«b or antigenic fragments 

invention, provided ^JZe ^^^I^ZiZT^ "* With 

severely deteriorated on the SELw e Sr ^ ^ ^ add ^S 63 are ™* 
individual vaccinated with this3l«n S 311 a PP ro P™te immune response in an 

individual sera. ^ ldenbfied ^ * d ™dual antibody preparations from 

serum reactive antigen axe selected from the 
and "Location of k^S^2S^~ IT™ —genie aa", 

especially peptides comprising amino^dd ^^ ^^S ^ 
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145; 5-27, 32-64, 92-102, 107-113, 119-125, 133-139, 148-162, 177-187, 195-201, 207-214, 241-251, 254-269, 
285-300, 302-309, 317-324, 332-357, 365-404, 411-425, 443-463, 470-477, 479-487, 506-512, 515-520, 532-547, 
556-596, 603-610, 616-622, 624-629, 636-642, 646-665, 667-674, 687-692, 708-720, 734-739, 752-757' 798-820^ 
824-851, 856-865 and 732-763 of Seq ID No 146; 14-21, 36-44, 49-66, 102-127, 162-167, 177-196, 45-109 and 
145-172 of Seq ID No 147; 17-35, 64-75, 81-92, 100-119, 125-172, 174-183, 214-222, 230-236, 273-282, 287- 
303, 310-315, 331-340, 392-398, 412-420, 480-505, 515-523, 525-546, 553-575, 592-598, 603-609, 617-625, 631- 
639, 644-651, 658-670, 681-687, 691-704, 709-716, 731-736, 739-744, 750-763, 774-780, 784-791 799-805' 809- 
822, 859-870, 880-885, 907-916, 924-941, 943-949, 973-986, 1010-1016, 1026-1036, 1045-1054, 1057-1062 1082- 
1088, 1095-1102, 1109-1120, 1127-1134, 1140-1146, 1152-1159, 1169-1179, 1187-1196, 1243-1251 1262-1273 
1279-1292, 1306-1312, 1332-1343, 1348-1364, 1379-1390, 1412-1420, 1427-1436, 1458-1468, 1483^1503 1524^ 
1549, 1574-1588, 1614-1619, 1672-1685, 1697-1707, 1711-1720, 1738-1753, 1781-1787, 1796-1801, 1826-1843, 
132-478, 508-592 and 1753-1810 of Seq ID No 148; 15-43, 49-55, 71-77, 104-110, 123-130, 162-171 180-192 
199-205, 219-227, 246-254, 264-270, 279-287, 293-308, 312-322, 330-342, 349-356, 369-377, 384-394, 401-406 
416-422, 432-439, 450-460, 464-474, 482-494, 501-508, 521-529, 536-546, 553-558, 568-574, 584-591, 602-612 
616-626, 634-646, 653-660, 673-681, 688-698, 705-710, 720-726, 736-749, 833-848, 1-199, 200-337, 418-494 and 
549-647 of Seq ID No 149; 9-30, 65-96, 99-123, 170-178 and 1-128 of Seq ID No 150; 7-32, 34-41 96-106 
127-136, 154-163, 188-199, 207-238, 272-279, 306-312, 318-325, 341-347, 353-360, 387-393 399-406 434-44o' 
452-503, 575-580, 589-601, 615-620, 635-640, 654-660, 674-680, 696-701, 710-731, 1-548 and 660-691 of Seq" 
ID No 151; 4-19, 35-44, 48-59, 77-87, 93-99, 106-111, 130-138, 146-161 and 78-84 of Seq ID No 152; 24-30 
36-43, 64-86, 93-99, 106-130, 132-145, 148-165, 171-177, 189-220, 230-249, 251-263, 293-300, 302-312, 323-329 
338-356, 369-379, 390-412 and 179-193 of Seq ID No 153; 30-39, 61-67, 74-81, 90-120, 123-145, 154-167 169- 
179, 182-197, 200-206, 238-244, 267-272 and 230-265 of Seq ID No 154; 14-20, 49-65, 77-86 and 2-68 of Seq 
ID No 155; 4-9, 26-35, 42-48, 53-61, 63-85, 90-101, 105-111, 113-121, 129-137, 140-150, 179-188 199-226 228- 
237, 248-255, 259-285, 299-308, 314-331, 337-343, 353-364, 410-421, 436-442 and 110-144 of Seq ID No 156- 
36-47, 55-63, 94-108, 129-134, 144-158, 173-187, 196-206, 209-238, 251-266, 270-285, 290-295, 300-306 333^ 
344, 346-354, 366-397, 404-410, 422-435, 439-453, 466-473, 515-523, 529-543, 554-569, 571-585 590-596 607- 
618, 627-643, 690-696, 704-714, 720-728, 741-749, 752-767, 780-799, 225-247 and 480-507 of Seq ID No 157- 
16-25, 36-70, 80-93, 100-106 and 78-130 of Seq ID No 158; 18-27, 41-46, 50-57, 65-71, 79-85, 93-98 113-128 
144-155, 166-178, 181-188, 201-207, 242-262, 265-273, 281-295, 303-309, 318-327 and 36-64 of Seq ID No 159- 
7-29, 31-44, 50-59, 91-96, 146-153, 194-201, 207-212, 232-238, 264-278, 284-290, 296-302, 326-353, 360-37o' 
378-384, 400-405, 409-418, 420-435, 442-460, 499-506, 529-534, 556-562, 564-576, 644-651, 677-684, 687-698 
736-743, 759-766, 778-784, 808-814, 852-858, 874-896, 920-925, 929-935, 957-965, 1003-1012, 1021-1027 1030^ 
1044, 1081-1087, 1101-1111, 1116-1124, 1148-1159, 1188-1196, 1235-1251, 1288-1303, 1313-1319, 1328-1335 
1367-1373, 1431-1437, 1451-1458, 1479-1503, 1514-1521, 1530-1540, 1545-1552, 1561-1568, 1598-1605 1617- 
1647, 1658-1665, 1670-1676, 1679-1689, 1698-1704, 1707-1713, 1732-1738, 1744-1764, 1-70, 154-189 922-941 
1445-1462 and 1483-1496 of Seq ID No 160; 6-51, 81-91, 104-113, 126-137, 150-159, 164-174 197-209 215- 
224, 229-235, 256-269, 276-282, 307-313, 317-348, 351-357, 376-397, 418-437, 454-464, 485-490, 498-509^ 547- 
555, 574-586, 602-619 and 452-530 of Seq ID No 161; 25-31, 39-47, 49-56, 99-114, 121-127, 159-186 228-240 
253-269, 271-279, 303-315, 365-382, 395-405, 414-425, 438-453 and 289-384 of Seq ID No 162- 9-24 41-47 
49-54, 68-78, 108-114, 117-122, 132-140, 164-169, 179-186, 193-199, 206-213, 244-251, 267-274, 289-294 309- 
314, 327-333, 209-249 and 286-336 of Seq ID No 163; 9-28, 53-67, 69-82, 87-93, 109-117 172-177 201-207 
220-227, 242-247, 262-268, 305-318, 320-325 and 286-306 of Seq ID No 164; 4-10, 26-39, 47-58, 63-73 86-96 
98-108, 115-123, 137-143, 148-155, 160-176, 184-189, 194-204, 235-240, 254-259, 272-278 and 199-283of Seq 
ID No 165; 4-26, 33-39, 47-53, 59-65, 76-83, 91-97, 104-112, 118-137, 155-160, 167-174, 198-207, 242-268 273- 
279, 292-315, 320-332, 345-354, 358-367, 377-394, 403^10, 424-439, 445-451, 453-497, 511-518, 535-57o' 573- 
589, 592-601, 604-610 and 202-242 of Seq ID No 166; 8-30, 36-45, 64-71, 76-82, 97-103 105-112 134-151 
161-183, 211-234, 253-268, 270-276, 278-284, 297-305, 309-315, 357-362, 366-372, 375-384, 401-407' 409-416 
441-455, 463-470, 475-480, 490-497, 501-513, 524-537, 552-559, 565-576, 581-590, 592-600, 619-625' 636-644' 
646-656 and 316^19 of Seq ID No 167; 4-17, 52-58, 84-99, 102-110, 114-120, 124-135, 143-158, 16o'-173 177- 
196, 201-216, 223-250, 259-267, 269-275 and 1-67 of Seq ID No 168; 6-46, 57-67, 69-80 82-133 I37W 147- 
168, 182-187, 203-209, 214-229, 233-242, 246-280 and 53-93 of Seq ID No 169; 7-40, 50-56 81-89 117-123 
202-209, 213-218, 223-229, 248-261, 264-276, 281-288, 303-308, 313-324, 326-332, 340^346, 353-372 434-443 
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465-474, 514-523, 556-564, 605-616, 620-626, 631-636, 667-683, 685-699, 710-719, 726-732, 751-756 760-771 
779-788, 815-828, 855-867, 869-879, 897-902, 917-924, 926-931, 936-942, 981-1000, 1006-1015, 1017-1028 
1030-1039, 1046-1054, 1060-1066, 1083-1092, 1099-1112, 1122-1130, 1132-1140, 1148-1158 1161-1171 1174^ 
1181, 1209-1230, 1236-1244, 1248-1254, 1256-1267, 1269-1276, 1294-1299, 1316-1328, 1332-1354 1359-1372 
1374-1380, 1384-1390, 1395-1408, 1419-1425, 1434-1446, 1453-1460, 1465-1471, 1474-1493, 1505-1515 1523^ 
1537, 1547-1555, 1560-1567, 1577-1605, 1633-1651, 1226-1309, 1455-1536 and 1538-1605 of Seq ID No 170- 

4- 10, 31-39, 81-88, 106-112, 122-135, 152-158, 177-184, 191-197, 221-227, 230-246, 249-255, 303-311 317-326 
337-344, 346-362, 365-371, 430-437, 439-446, 453-462, 474-484 and 449-467 of Seq ID No 171- 9-15 24-35 
47-55, 122-128, 160-177, 188-196, 202-208, 216-228, 250-261, 272-303, 318-324, 327-339, 346-352 355-36l' 
368-373, 108-218 and 344-376 of Seq ID No 172; 6-14, 17-48, 55-63, 71-90, 99-109, 116-124, 181-189 212-223 
232-268, 270-294, 297-304, 319-325, 340-348, 351-370, 372-378, 388-394, 406-415, 421-434 and 177-277 of Sea 
ID No 173; 21-39, 42-61, 65-75, 79-85, 108-115 and 11-38 of Seq ID No 174; 4-17, 26-39, 61-76 103-113 115- 
122, 136-142, 158-192, 197-203, 208-214, 225-230, 237-251 and 207-225 of Seq ID No 175; 5-11, 27-36 42-53 
62-70, 74-93, 95-104, 114-119, 127-150, 153-159, 173-179, 184-193, 199-206, 222-241, 248-253 257-280 289- 
295, 313-319, 322-342, 349-365, 368-389, 393-406, 408-413, 426438, 447-461, 463470, 476495 532-537 543- 
550 and 225-246 of Seq ID No 176; 4-29, 68-82, 123-130, 141-147, 149-157, 178-191, 203-215' 269-277' 300- 
307, 327-335, 359-370, 374-380, 382-388, 393400, 410-417, 434442, 483492, 497-503, 505-513 533-540 564- 
569, 601-607, 639-647, 655-666, 693-706, 712-718, 726-736, 752-758, 763-771, 774-780, 786-799 806-812 820- 

^; 8 ^;^ 8 ^ 92 ' 901 " 909 ' 925 " 932 ' 943 " 948 ' "°- 996 ' 1030 - 1036 ' 1051 - 1059 ' ^-lOeS, 1079-1086, 1105- 
1113, 1152-1162, 1168-1179, 1183-1191, 1204-1210, 1234-1244, 1286-1295, 1318-1326, 1396-1401 1451-1460 

It^t Jf 7 - 1483 ' 1488 -1494, 1505-1510, 1514-1521, 1552-1565, 1593-1614, 1664-1672, 1677-1685, 1701- 
1711, 1734-1745, 1758-1770, 1784-1798, 1840-1847, 1852-1873, 1885-1891, 1906-1911, 193M939 1957-1970 
1977-1992, 2014-2020, 2026-2032, 2116-2134, 1-348, 373490, 573-767, 903-1043, 1155-1198, 1243-1482 1550 

IT'.lfZll 719 ' 1793 " 1921 ^ 2 ° 08 - 2110 ° f SC< 1 ID No 177 ' 10 - 35 ' 39 " 52 ' ™-l!2, 181-188, 226-236, 238- 
253, 258-268, 275-284, 296-310, 326-338, 345-368, 380-389, 391408, 410418, 420429, 444-456 489-505 573- 
588, 616-623, 637-643, 726-739, 741-767, 785-791, 793-803, 830-847, 867-881, 886-922, 949-956 961-98o' 988- 
1004, 1009-1018, 1027-1042, 1051-1069, 1076-1089, 1108-1115, 1123-1135, 1140-1151 IwS* iSlS 
1210-1221, 1223-1234, 1242-1250, 1255-1267, 1281-1292, 1301-1307, 13154340, 1348-1355 13664373 < S£' 
1413, 1417-1428, 1437-1444, 1453-1463, 1478-1484, 1490-1496, 1498-1503, 1520-1531 153845^ 1^S?0 
1593-1603, 1612-1625, 1635-1649, 1654-1660, 1670-1687, 1693-1700, 17054711, 171M^ J^ITO 
1813, 1871-1881, 1893-1900, 1907-1935, 1962-1970, 1992-2000, 2006-2013, 2033-2039, 2045-2051 2055-2067 
2070-2095, 2097-2110, 2115-2121, 2150-2171, 2174-2180, 2197-2202, 2206-2228 and 1526-^560 ofSeq^ No 
IU ti 'J^fln 5 ? 76 ' 78 " 107 ' 109 " 115 ' 118 - 127 ' 134 - 140 ' 145 - 156 ' 169 " 174 ' 217 -226, 232-240, 256-262, 267- 
353 " 36 °' 4 ° 2 - 409 ' 416 - 439 ' 448 ' 456 ' 506 - 531 ' 54 °- 546 ' 570-578, 586-593 595-600 623- 

SJtS TJ^ 713 " 724 ' 73 °- 74 °' 757 " 763 ' 773 " 778 ' 783 " 796 ' 829 " 835 ' 861 - 871 ' 88 8"899, 907- 

939, 941-955, 957-969, 986-1000, 1022-1028, 1036-1044, 1068-1084, 1095-1102, 1118-1124 1140-1146 1148- 
1154, 1168-1181, 1185-1190, 1197-1207, 1218-1226, 1250-1270, 1272-1281, 1284-1296, 

1383-1409, 1422-1428, 1438-1447, 1449-1461, 1482-1489, 1504-1510, 1518-1527, 15294537 1544-1551 l2 

Z\^T*£T]f 7 ' 1682 " 1689 ' mi ' 1718 ' 1733 - 1740 ' 1772 - 1783 ' l«8-1834 ?859^y aid 
llftll q N ° 179; 8 " 28 ' 32 " 37 ' 62 " 69 ' 119 " 125 ' 137 ' 149 ' 159 - 164 ' 200-205, 221-229, 240-245 

258-265, 268-276, 287-293, 296-302, 323-329 and 1-95 of Seq ID No 180; 9-18, 25-38, 49-63, 65-72 74-81 S 
117, 131-137, 139-146, 149-158, 162-188, 191-207, 217-225, 237-252, 255^69, 281-293, 30W26, 332^2 W- 
354, 363-370, 373-380, 391400, 415424, 441447 and 75-107 of Seq ID No 181- 4-24 64-51 81^87 96 116 

^£2°?f T 155 ' 166 " 173 - 176 " 184 ' 203 - 215 ' 231 - 238 - 243?248 ' 

and 67-148 of Seq ID No 182; 4-10, 19-37, 46-52, 62-81, 83-89, 115-120, 134-139, 141-151 168-186 197 205 
209-234, 241-252, 322-335, 339-345, 363-379, 385-393, 403431, 434442 447454 4594^ 47^84 
and 404420 of Seq ID No 183; 10-35, 46-66, 71-77, 84-93, 96-122, 138-148, 154-172, Sfl* 221-23^ 2£ 
6 SS? 295 " 301 ' 303 " 3 ° 9 ' 311 - 320 ' 324 ' 336 ' 340 - 348 ' 351 - 359 ' 375 "381 -d 111498 of Seq ro No lS 

J5f£ « '*f ^ o 67 " 75 ' 81 " 91 ' 98 " 106 ' 114 - 122 ' 124 " 135 ' 148 " 193 ' 209 " 227 - d 198 - 2 13 of Seq m No » 

5- 18, 45-50, 82-90, 97-114, 116-136, 153-161, 163-171, 212-219, 221-227, 240-249 267-281 311 317 V>1 15' 
375-381 390-395, 430436, 449455, 484495, 538-543, 548-554, '556-564, 59 ZmJ££5£5Z 
ID No 186; 9-25, 28-34, 3744, 61-68, 75-81, 88-96, 98-111, 119-133, 138-150, 152-163, 168-182 1^194^00 
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Sld 6 "^ 6 "^' ? 279 ' 287 ' 293 " 3 ° 4 ' 325 ^°' 353-358, 365-379, 399-409 444- 

Z S2 SStSS, * ^ 187; ^' ^ 66 - 76 ' 78 - 102 ' m - 130 ' i34 - 146 - ^ 22 

S;' ^ ri' J™!' 329 " 335 ' 348 " 362 ' 370-376, 379-384, 390-406, 420-429, 466-471, 479-489 495 504 529 

mam 

sammmmsm 

338, 357-363, 391-403 406^416 »d 360^8 m 2 196 " 209 ' 258 " 280 ' 32 *- 

339 357^69^83^ 395^' JSS 'f 188 ' 20 °- 213 ' 245 - 25 °' 266 " 272 ' 332- 

122-137 ^Vl^SS^^^"?™ of Seq ID No 207; 35-59, 7M6, 1U-U7. 

192, 2,3-221. 231-258 ,j££2£S ££££ 62 S^ltt™* ^ 

Of *, m No 210; 7-21. 30-48. 51-58, ^ ^^5^^^^ 
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SSJ' SIS' 217 235 ?l^* 2 ^ £ n Seq ID N ° ^ 69 " 76 ' 83 " 90 ' 123-^ 143-151, 

' ' 243_263 ' 303 - 309 ' 3 20 - 333 ' 338-348, 367-373, 387-393, 407-414 416-427 441 457 

473482, 487-499, 501-509, 514-520, 530-535, 577-583, 590-602, 605-612, 622-629 64^67 67wS^S^ 

339, 395-409, 414-420, 446-451, 475-487, 556-563, 568-575, 580-586, 588-595, 633-638 643-648 652-659 672 

5? SSStSSS ^ 761 - 767 - 775 - 781 - 796 - 806; £££ S 

915, 934-941 and 406-521 of Seq ID No 213; 9-18, 24-46, 51-58, 67-77, 85-108 114-126 129 137 139 

™^£%££-g£?£™' 2E£ 

. 522-527, 562-591, 599-606, 609-618, 625-631, 645-652 and 577-654 of Sea ID No 214- 13-20 
26-37, 41-53, 56-65, 81-100, 102-114, 118-127, 163-188, 196-202, 231-238, 245-25^266 2^ 301 ^ 

and 19-78 of Seq ID No 215; 10-23, 32-42, 54-66, 73-91, 106- 13, 118- 2 Ss TSt^S^'SL 
245, 284-300, 313-318, 330-337, 339-346, 354-361, 387-393, 404426, 429-439 441-2 46^473 47^' 22" 

S' S^SSSTTS 57 r 88 - 610 - 615 - 627 - 63s; ^ £S 2: 

7 S'3fJ l ' 48 °" 55 ° ° f Se< * ID No 216 ^ 6 - 39 - 42-50, 60-68, 76-83, 114-129 147-162 170-1R9 

I 17 '" 31 ' Z39 ' 248 ' 2 "' 305 ' 338 " 344 ' 352 - 357 ' 37 1" 377 < 380-451, 459-483 49^-499 507 523 537 559 

iu/1, xu/y iiwy, 1103-1127, 1151-1185, 1197-1261, 1269-1309, 1317-1333 1341-1349 iw msc -iazq 1E1 , 

ssjix it 1629 ' i637 - 1669 ' i677 - 1701 ' i7 ° 9 - 1725 ' isSiS ss^ss^rjs 

2025, 2029-2053, 2061-2109, 2117-2125, 2133-2183, 2195-2219, 2227-2271 2275-2299 2307 231 1 

££121 2395 " 2429 ' 2441 ' 2529 ' 2537 " 2569 ' 2577 - 2601 ' 2609126 ^ *£aS? 2^27^7 27^27^2^ 
2867, 2889-2913, 2921-2937, 2945-2969, 2977-2985, 2993-3009, 3023-3045, 30^-3099 S 316^ 3T75 w 

5-26, 3343, 48-54, 58-63, 78-83, 113-120, 122-128, 143-152, 157-175, 185-192 211-225 227 234 SlZra old 
281, 284-290, 304-310, 330-337, 348-355, 362-379, 384-394 429-445 J^SSmS 5^' S" 
554, 561-586, 590-604, 613-629, 149-186, 285-431 and 573^659 of Seq ID i££2£ 2£ fl 67 a3 9l" 

343 347-3^ U7 ' 182 ' 224 " 232 ' ^S 9 - 270 ' 3 «1- 3 10, 313-322, 332- 

343, 347-367, 384-398, 416-429, 431-446, 454461 and 1-169 of Seq ID No 222- 8-17 26-31 56^62 7* *k <n 

i 1 ; ^"f: 118 ; 124 ' 144 - 164 - 133-193, 197-204, 215-223, 227-236, 242-249 252 259 261 270 

SS^^l^if*?? 343 " 356 ' 385 " 392 ' 39MM * 40M17 ' 435 ' 441 ^ d 24 "50 of Seq ID No 226- 
™ f 1 " 61 ' 119 * 126 HU of Se 1 10 No 227; 5-17, 40-58, 71-83, 103-111 123-140 167 177 18^ 

TeTt^tlm 1f eq £™° "* *>' U - 50 ' 57 - 70 ' 112 " 123 ' 127 " 138 -d of s^ m Z»S 

I zu *^>' 271 "283, 285-293 and 139-155 of Seq ID No 231; 42-70 73-90 92-iriR 112 107 in 

« £2f S S S ^ ^ ^'^S SS: S 

*jo, 518-523, 531-536, 561-567, 576-589, 594-612, 620-631, 665-671 697-710 71 S 7*1 txc 7=/ ^= 
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298-308, 325-333, 345-355, 365-371, 382-395, 424^35, 441-457, 46*472, 483-491, 493-505, 528-534 536-546 
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of at least 10 amino acids SubssouTtfTZ, n™ P ? by 1 amino add with » le »S* 

mav be used to eonfLL 'SSSSS^KS** «~»™*« * *■ elopes 

pronuclei* acid ^"^T^ — ' - *~ ^ — 

SLKtrt^jS^S-i fc «*- a — * — — * - nucleic add 
invention and fte ^ucdcT^f h^2 m ^ *" S™*" 1 * *»> «*» * fte 

recombi^nttechruqSes "vpenmmune serum reactive antigens and fragments thereof by 

P-P^« r ^n Ml ia2dstoe^a^^ Hd ^^.Zr tor H 8 s d,able 10 m " in,ain - 
regard. In accordance with this asnett of ft. i™ °^ pep " ae m a nosl ma y be used for expression in this 

a single or double-sh-anX TsZ^7^ *T'?7 * ** eXamplei a P ,asmid 

plasmids disclosed herein Zf^Z^^, ° rdou " e ; s,IandadR NAor DMA viral vector. Starting 
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ZZtZl TT mamier t0 Pr ° dUCe gene P roduct ^ me recombinant sequence 

f^rrjnl 7 ' ^P™""™ serum re ^ve antigens and fragments thereof of the invention^ be 
SantS 7 T vT C ° nVentional P e P tide synthesizer,. Mature proteins can be express^ fa 

manunahan cells, yeast, bacteria, or other cells under the control of appropriate promoters SSree^ 

«sr pM to produce - -w* 

moS^ 31 be gCnetically peered to incorporate nucleic acid molecules and express nucleic acid 
Z^lLtZ PieS& T 7T° n - Re P resentative —Pi- of appropriate hosts include bacterid ceSs 
ZTstZT^? ' T PlW ? COCd ' R Stre Ptom V ces and Bacillus subtillis cells; fungal cells sucTS 

such as CHO, COS, Hela, C127, 3T3, BHK, 293 and Bowes melanoma cells; and plant cells. 
^t Ven H° n / S ° Pr °^ deS 3 PTOCeSS f ° r P rodudn S * S. pneumoniae hyperimmune serum reactive 

oofvT\ hT^- ' Particularl y **** acids, may be added to the N- or C-te^ius of X 

deteaxm „f .he orgara8m ot o^us^ m a sample OTntaining ^ OTglmisms ^ 
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a diagnostic -^taSlS ^rmmabon of normal and abnormal levels. Thus, for Stance, 

compSed to normal SEST^TES -ed tTf * *" 

example, and to identify the inf * 7 t0 deteCt ** P resence of ^ infection, for 

a polypeptide m a sampt d^^l 7 techni ^ es *«* ca * be used to determine levels of 

methoTmdude rSS^^^^ m ^ ^ 4 ° *~ ° f skiU * «* Such assay 
assay, Among ^ ELl^^X^^^^ 8 ' Western Blot analysis and ELIsI 
antibody specific to the DolvoerSp , ^ 3SS3y ""^ comprises preparing an 
generally is prepared iSl^oT^ 7 " * ^ 3 re P orter -« b W 
detectable regent such Radioactive fluZTl ^ ^ "P 0 * is attached to a 

enzyme. radioactive, fluorescent or enzymatic reagent, such as horseradish peroxidase 

aiob^ut^ 

hyperimmune seru^Sve ^er^ aTfra™ ZV" T*' ^ at one of the 

immobilized on a support 7aS^ suoTort 12 * «° pr6Sent mVention ma 7 b * 

antigens and fragmen^f Xt&b.' =S b t *? dBB ~ — ^ 
hyperimmune serum reactive antieens aLX.IT \ ?T 7 c by USmg ° ne or several of me 

hyperimmune serum reactive SSI ™ d £ST? ?~f t0 *" P resent mvention «d/or 

of such array as^ll asTanC S Z bem S The characterizing feature 

position on srid^po* of ZZZ ?<£ZT2 « ? f 34 3 dfaUncl ° r P redefined or 
activity at a distinS position oTreSon of a P^PepUde is immobilized. Because of mis any 

number of different KS^S Z ^ * COrTelated ^* 3 S P edfic Polypeptide. The 
support may rangTfrom T^^IZJ? ^J^™* ***** immobiLd on a 

and fragments thereof Th^nJL 7u °° dlfferent h >Terimmune serum reactive antigens 

P^es/po^ndes^ *> 3t ^ «0 ~ 

antigens and fragments thereof per cm^. ^ hyperunmune serum reactive 

X^^^aylSel^^ ~, ^ * *" ** ~ * de ^ * » 

least a first surface ThlZ^w 7 C ° mpnSeS a P lanar ' P orous or non-porous solid support having at 

are ^^Z^Z^TZZT T ^ *~* 38 d ^ 2£ 

also within the pr^l^onS "Tf* 15 ^ S lass or ceUulose - » * 

herein. Apart from the ^2^1^^^ d^ deSCribed 
present invention also the nucleTa^d Zl^.f ? L fra 8ments thereof according to the 

generation of an array T^ J"L™^Z T § *S * PMert hw « tfan may be I far the 
-nodonalantibodiias^gl^ 

invention. The present tow£KS!2T f ^ ° F according to the present 
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hbrary It is within the present invention that the antibody may be chimeric, i. e. that different parts 
thereof stem from different species or at least the respective sequences are taken from different species. 

Antibodies generated against the hyperimmune serum reactive antigens and fragments thereof 
corresponding to a sequence of the present invention can be obtained by direct injection of the 
hyperimmune serum reactive antigens and fragments thereof into an animal or by administering the 
hypenmmune serum reactive antigens and fragments thereof to an animal, preferably a non-human The 
antibody so obtamed will then bind the hyperimmune serum reactive antigens and fragments thereof 
itself. In this manner, even a sequence encoding only a fragment of a hyperimmune serum reactive 
antigen and fragments thereof can be used to generate antibodies binding the whole native hyperimmune 
serum reactive antigen and fragments thereof. Such antibodies can then be used to isolate the 
hyperimmune serum reactive antigens and fragments thereof from tissue expressing those hyperimmune 
serum reactive antigens and fragments thereof. yF 

For preparation of monoclonal antibodies, any technique known in the art, which provides antibodies 
produced by continuous cell line cultures can be used (as described originally in {Kohler, G. et al., 1975}. 

Techniques described for the production of single chain antibodies (U.S. Patent No. 4,946,778) can be 

^f^ 0 *** to inununogenic hyperimmune serum reactive antigens and 
fragments thereof according to this invention. Also, transgenic mice, or other organisms such as other 

Z^Zt:^ US f !u ^f 58 humanized ^bodies to immunogenic hyperimmune serum reactive 
antigens and fragments thereof according to this invention. 

t!n?T ly ' P i aS ! diSplay * echnol °gy or ribos °™l d«Play could be utilized to select antibody genes 
with binding activities towards the hyperimmune serum reactive antigens and fragments thereof either 

^neZf^ T / "2 >Med V_geneS ° f fr ™ h-ians screwed for possessing 

respective target antigens or from naive libraries {McCafferty, J. et al., 1990}; {Marks, J. et al., 1992}. The 
affinity of these antibodies can also be improved by chain shuffling {Clackson, T. et al., 1991}. 

^25^E5^ ™ PrGSent ' 6aCh ^ ^ * dk6Cted ^ a **~ e P ito P e " 

nv^ e ^T deSCribed antib ° dieS ^ ^ empI ° yed t0 iS ° late OT to identif y dones expressing the 
hyperimmune serum reactive antigens and fragments thereof or purify the hyperimmune serum reactive 
antigens and fragments hereof of the present invention by attachment of the'Libody to a solid support 
for isolation and/or purification by affinity chromatography. PP 

Thus among others, antibodies against the hyperimmune serum reactive antigens and fragments thereof 
of the present invention may be employed to inhibit and/or treat infectfons, particularly bacS 
infections and especially infections arising from S. pneumoniae. 

Hyperimmune serum reactive antigens and fragments thereof include antigenically, epitopically or 
7 deriVatives ' which fo ™ « particular aspect of this invLion. The term 

antigenically equivalent derivative" as used herein encompasses a hyperimmune serum reactive antigen 

when TT? t ° r . itS 6qUiValent WMch WiU be s P edficaU y cognized by certain antibodies wS 
when raised to the protein or hyperimmune serum reactive antigen and fragments thereof accordlg to 

? u23oZn ^ With * e MteraCti ° n betWeen P ath °g- and mammalian host. The ttZ 

immunologically equivalent derivative" as used herein encompasses a peptide or its equivalent whi>h 
when used m a suitable formulation to raise antibodies in a verteLte, the^tibodies LTmteX ^£ 
the mteracfaon between pathogen and mammalian host s act to interfere with 

The hyperimmune serum reactive antigens and fragments thereof, such as an antigenically or 
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immunologically equivalent derivative or a fusion protein thereof can be used as an antigen to immunize 

hv^n° r " ntaal " 3 m ° r ^ ™ e P-tein may provide ^bUnTto^ 

hypenmmune serum reactive antigens and fragments thereof. The antigen may be associated for 
example by conjugation, with an immunogenic carrier protein, for example bovine serum amum^S^ 
or keyhole hmpet haemocyanin (KLH). Alternatively an antigenic peptide comprisu^TuS^pS o 
n^unoT m °n ^^T 1 " 16 SGrUm reaCtiVe and fragments thereof or L ^tig^y or 

in vivo mfechon usmg cloned retroviral vectors {Seeger, C. et al v 1984} * 

IpflllllEilil 

ES^ST ^acfcrisation prefenrbry comprises the 

peptides taC^Sto"^^:,, « VT" " 6 "° t Um " ed to lm 8** h °"«™*- P^«*ly 
methods ™Tf~ oTf^ I ? " mm ° addS 316 P"*""? obtained in the Lpecdve 

ho^t; ^S^t may * abou ' 101 ,o 10 " W *° ™" Afferent Pi«** 



tSd^SLi^,^ 8 "T senan leacave — thereof are fc 

called andcahnes wh.Aaie, among others, described in Gennan patent appltotionDE 19742 706 

^TL^^t^n nud ^ c adds •» * ^ 

invennon JZZTJTZ ZTJZZI ^ — *-» * *■ 



invention »d the basic ateps are .mown to^ idTZLb^^^aS 



are 
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preferably aptamers and spiegelmers. 

Aptamers are D-nucleic adds, which are either single stranded or double stranded and which specifically 
interact with a target molecule. The manufacture or selection of aptamers is, e.g. described in European 
patent EP 0 533 838. Basically the following steps are realised. First, a nuxtule of nucleic ac^T 
potential aptamers, is provided whereby each nucleic acid typically comprises a segment of several 
preferably at least eight subsequent randomised nucleotides. This mixture is subsequently contacted with 
the target molecule whereby the nucleic acid( S ) bind to the target molecule, such as based on an increased 
affinity towards the target or with a bigger force thereto, compared to the candidate mixture. The binding 
nucleic acid(s) are/is subsequently separated from the remainder of the mixture. Optionally, the thus 
obtained nucleic acid(s) is amplified using, e.g. polymerase chain reaction. These steps may be repeated 
several tunes giving at the end a mixture having an increased ratio of nucleic acids specifically binding to 
*e target from which the final binding nucleic acid is then optionally selected. These specifically binlng 

or fd^«S ( ? ^ t0 38 aptamerS ' 14 iS ° bvioUS ** at sta S e of »■ me*** for the generation 

or identification of the aptamers samples of the mixture of individual nucleic acids may be taken to 
determine the sequence thereof using standard techniques. It is within the present invention that the 
S™ 8 ^ ^ **' 6 - 8 " by Cueing defined chemical groups which are known to 

the one skilled m the art of generating aptamers. Such modification may for example reside in the 

Tr^lTZZ ^r 1 ™ a ^ U f 3t *" 2 '-P° sition of *h e «8« moiety of the nucleotides. Aptamers are 
currently used as therapeutical agents. However, it is also within the present invention that the thus 

SvSotmlf^H aPt T rS r 7 ^ 6 ™* d f ° r *** VaUdati0n mAlot 38 lead ^^tance for the 
development of medicaments, preferably of medicaments based on small molecules. This is actually done 

\ a uZ X Whereb y me interaction between the target molecule and the aptamer is 

inhibited by a candidate drug whereby upon replacement of the aptamer from the complex of target and 
aptamer it may be assumed that the respective drug candidate allows a specific inhibition of the 
interaction between target and aptamer, and if the interaction is specific, said candidate drug will, at least 
in principle, be suitable to block the target and thus decrease its biological availability or activity in a 
respective system comprising such target. The thus obtained small molecule may then be subject to 
fiuther denvatisation and modification to optimise its physical, chemical, biological and/or medical 
characteristics such as toxicity, specificity, biodegradability and bioavailability. 

Spiegelmers and their generation or manufacture is based on a similar principle. The manufacture of 
spiegelmers is described in international patent application WO 98/08856. Spiegelmers are L-nucleic 
aads, which means that they are composed of ^nucleotides rather than D-nudeotides as aptamers are. 
comSrr ^ aCteiized ^ J* fact ** *ey have a very high stability in biological systems and, 
comparable to aptamers, specifically interact with the target molecule against which they are directed In 
fte process of generating spiegelmers, a heterogeonous population of D-nudeic adds is created and mis 
population is contacted with the optical antipode of the target molecule, in the present case for example 
with the D-enanuomer of the naturally occurring L-enantiomer of the hyperimmune serum reactive 
antigens and fragments thereof according to the present invention. Subsequently, those D-nucleic adds 
are separated which do not interact with the optical antipode of the target molecule. But those r>nudeic 
£1 ^ ^g wxm the op^ mapodB of Ae ^ molecule m ^ 
nud£ T -d subsequently the corresponding L-nudeic adds are synthesized based on the 
iSL^f tZH mf0nnati0n ? *f ed from *• »««Wc adds. These L^udeic adds which are 
identical in terms of sequence with the aforementioned D-nudeic adds interacting with the optical 
antipode of the target molecule, will specifically interact with the naturally occurr^^argefmoCe 
rather than with the optical antipode thereof. Similar to the method for me generation oTaltm^s^ s 

mte T 1 VariOUS St6pS S6VeraI md *"» to «** ^e nudeic add^dfic^ly 

mteracting with the optical antipode of the target molecule. specmcauy 

JTudet^n ZIZT Pr6Se ? inVCnti0n relat6S *° funCti0nal nucIeic ^ds interacting with any of the 
nudeic acid molecules according to the present invention, and a method for the manufacture of sucn 
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functional nucleic acids whereby the method is characterized by the use of the nucleic acid molecules and 
their respective sequences according to the present invention and the basic steps are known to the one 
skilled m the art. The functional nucleic acids are preferably ribozymes, antisense oligonucleotides and 
siRNA. 



Ribozymes are catalytically active nucleic acids which preferably consist of RNA which basically 
comprises two moieties. The first moiety shows a catalytic activity whereas the second moiety is 
responsible for the specific interaction with the target nucleic acid, in the present case the nucleic add 
coding for the hyperimmune serum reactive antigens and fragments thereof according to the present 
invention Upon interaction between the target nucleic acid and the second moiety of the ribozyme, 
typically by hybridisation and Watson-Crick base pairing of essentially complementary stretches of bases 
on ihe two hybridising strands, the catalytically active moiety may become active which means that it 
catalyses, either intramolecularly or intermolecularly, the target nucleic acid in case the catalytic activity 
of the ribozyme is a phosphodiesterase activity. Subsequendy, there may be a further degradation of the 
target nucleic acid, which in the end results in the degradation of the target nucleic add as well as the 
protein derived from the said target nudeic add. Ribozymes, their use and design prindples are known 
to the one skilled in the art, and, for example described in {Doherty, E. et al., 2001} and {Lewin, A. et al.. 

The activity and design of antisense oligonudeotides for the manufacture of a medicament and as a 
diagnostic agent, respectively, is based on a similar mode of action. Basically, antisense oligonudeotides 
hybridise based on base complementarity, with a target RNA, preferably with a mRNA, thereby 
activating RNase H. RNase H is activated by both phosphodiester and phosphorothioate-coupled DNA 
Phosphodiester-coupled DNA, however, is rapidly degraded by cellular nucleases with the exception of 
phosphorothioate-coupled DNA. These resistant, non-naturally occurring DNA derivatives do not inhibit 
SJT^t J rfdisation with RNA " fo other words, antisense polynudeotides are only effective as 
DNA RNA hybnde complexes. Examples for this kind of antisense oligonucleotides are described 
among others, in US-patent US 5,849,902 and US 5,989,912. In other words, based on the nudeic add 
sequence of the target molecule which in the present case are the nucleic add molecules for the 
hyperimmune serum reactive antigens and fragments thereof according to the present invention, either 
from the target protein from which a respective nucleic add sequence may in principle be deduced, or by 
knowing the nucleic add sequence as such, particularly the mRNA, suitable antisense oligonucleotides 
may be designed base on the principle of base complementarity. 

Vj rf T d . ^^^udeotides, which have a short stretch of phosphorothioate 
DNA (3 to 9 bases). A minimum of 3 DNA bases is required for activation of bacterial RNase H and a 
minimum of 5 bases is required for mammalian RNase H activation. In these chimeric oligonudeotides 
there is a central region that forms a substrate for RNase H that is flanked by hybridising "arms" 
compnsed of modified nucleotides that do not form substrates for RNase H. The hybridising arms of the 
chimeric oligonucleotides may be modified such as by 2'-0-methyl or 2'-fluoro. Alternative approaches 
used methylphosphonate or phosphoramidate linkages in said arms. Further embodiments of the 
Zr«Tv ° h f° nUCl ? 0tid " USe ^ * ** P ractice of *e present invention are P-methoxyoligonudeotides, 
partial P-methoxyohgodeoxyribonudeotides or P-methoxyoligonucleotides. 

Of particular relevance and usefulness for the present invention are those antisense oligonudeotides as 
more particularly described in the above two mentioned US patents. These oligonudeotides contain no 
nativity occurring 5;-»3'-linked nucleotides. Rather the oligonucleotides have two types of nucleotides- 
2 -deoxyphosphorothioate, which activate RNase H, and 2'-modified nucleotides, which do not The 
Sr h b rr * e 5' m0dified nUde ° tideS C3n be Phosphodiesters, pho^horothioate or p! 
re^oTwh^ 0 ^ K Activation of «N« H is accomplished by a contiguous RNase H-acti^ating 
H^ndh^t 1 TT, nTT 3 ^ 5 ^--yPhosphorothioate nudeotides to activate bacterial RnTs! 
H and between 5 and 10 2'- deoxyphosphorothioate nucleotides to activate eucaryotic and, particularly 
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mammalian RNase H. Protection from degradation is accomplished by making the 5' and 3' terminal 
bases highly nuclease resistant and, optionally, by placing a 3' terminal blocking group. 

More particularly, the antisense oligonucleotide comprises a 5' terminus and a 3' terminus- and from 
position 11 to 59 5'-»3'-linked nucleotides independently selected from the group consisting of 2'- 
modified phosphodiester nucleotides and 2'-modified P-alkyloxyphosphotriester nucleotides- and 
wherein the 5'-terminal nucleoside is attached to an RNase H-activating region of between three and ten 
contiguous phosphorothioate-linked deoxyribonucleondes, and wherein the 3'-terminus of said 
oligonucleotide is selected from the group consisting of an inverted deoxyribonucleotide, a contiguous 
stretch of one to three phosphorothioate 2'-modified ribonucleotides, a biotin group and a P- 
alkyloxyphosphotriester nucleotide. 

Also an antisense oligonucleotide may be used wherein not the 5' terminal nucleoside is attached to an 
RNase H-activating region but the 3' terminal nucleoside as specified above. Also, the 5' terminus is 
selected from the particular group rather than the 3' terminus of said oligonucleotide. 

The nucleic acids as well as the hyperimmune serum reactive antigens and fragments thereof according 
to the present invention may be used as or for the manufacture of pharmaceutical compositions! 
especially vaccines. Preferably such pharmaceutical composition, preferably vaccine is for the prevention 
or treatment of diseases caused by, related to or associated with S. pneumoniae. In so far another aspect of 
the mvention relates to a method for inducing an immunological response in an individual, particularly a 
mammal, which comprises inoculating the individual with the hyperimmune serum reactive antigens 
and fragments thereof of the invention, or a fragment or variant thereof, adequate to produce antibodies 
to protect said individual from infection, particularly streptococcal infection and most particularly S 
pneumoniae infections. y ' 

Yet another aspect of the invention relates to a method of inducing an immunological response in an 
individual which comprises, through gene therapy or otherwise, delivering a nucleic acid functionally 
encoding hyperimmune serum reactive antigens and fragments thereof, or a fragment or a variant 
thereof, for expressing the hyperimmune serum reactive antigens and fragments thereof, or a fragment or 
a variant thereof in vivo in order to induce an immunological response to produce antibodies or a cell 
mediated T cell response, either cytokine-producing T cells or cytotoxic T cells, to protect said individual 
from disease, whether that disease is already established within the individual or not. One way of 
administering the gene is by accelerating it into the desired cells as a coating on particles or otherwise. 

A further aspect of the invention relates to an immunological composition which, when introduced into a 
host capable of having induced within it an immunological response, induces an immunological response 
m such host, wherein the composition comprises recombinant DNA which codes for and expresses an 
antigen of the hyperimmune serum reactive antigens and fragments thereof of the present invention. The 
mimunological response may be used therapeutically or prophylactically and may take the form of 
antibody immunity or cellular immunity such as that arising from CTL or CD4+ T cells. 

The hyperimmune serum reactive antigens and fragments thereof of the invention or a fragment thereof 
may be fused with a co-protein which may not by itself produce antibodies, but is capable of stabilizing 
the first protein and producing a fused protein which will have immunogenic and protective properties 

/r^" 311 *^ 0 ^ P feferabl y comprises an antigenic co-protein, such as 

Glutamione-S-transferase (GST) or beta-galactosidase, relatively large co-proteins which solubilise the 
protein and facilitate production and purification thereof. Moreover, the co-protein may act as an 
adjuvant m me sense of providing a generalized stimulation of the immune system. The co-protein may 
be attached to either the amino or carboxy terminus of the first protein. 

Also, provided by this invention are methods using the described nucleic acid molecule or particular 



WO 2004/092209 



PCT/EP2004/003984 



-38- 

fragments thereof in such genetic immunization experiments in animal models of infection with S. 
pneumoniae. Such fragments will be particularly useful for identifying protein epitopes able to provoke a 
prophylactic or therapeutic immune response. This approach can allow for the subsequent preparation of 
monoclonal antibodies of particular value from the requisite organ of the animal successfully resisting or 
clearing infection for the development of prophylactic agents or therapeutic treatments of S. pneumoniae 
infection in mammals, particularly humans. 

The hyperimmune serum reactive antigens and fragments thereof may be used as an antigen for 
vaccination of a host to produce specific antibodies which protect against invasion of bacteria, for 
example by blocking adherence of bacteria to damaged tissue. Examples of tissue damage include 
wounds in skin or connective tissue and mucosal tissues caused e.g. by viral infection (esp. respiratory, 
such as the flu) mechanical, chemical or thermal damage or by implantation of indwelling devices, or 
wounds m the mucous membranes, such as the mouth, mammary glands, urethra or vagina. 

The present invention also includes a vaccine formulation, which comprises the immunogenic 
recombinant protein together with a suitable carrier. Since the protein may be broken down in the 
stomach, it is preferably administered parenterally, including, for example, administration that is 
subcutaneous, intramuscular, intravenous, intradermal intranasal or transdermal. Formulations suitable 
for parenteral adrninistration include aqueous and non-aqueous sterile injection solutions which may 
contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation isotonic with the 
bodily fluid, preferably the blood, of the individual; and aqueous and non-aqueous sterile suspensions 
which may include suspending agents or thickening agents. The formulations may be presented in unit- 
dose or multi-dose containers, for example, sealed ampoules and vials, and may be stored in a freeze- 
dried condition requiring only the addition of the sterile liquid carrier immediately prior to use. The 
vaccine formulation may also include adjuvant systems for enhancing the immunogenicity of the 
formulation, such as oil-in-water systems and other systems known in the art. The dosage will depend on 
the specific activity of the vaccine and can be readily determined by routine experimentation. 

According to another aspect, the present invention relates to a pharmaceutical composition comprising 
such a hyperimmune serum-reactive antigen or a fragment thereof as provided in the present invention 
for S. pneumoniae. Such a pharmaceutical composition may comprise one, preferably at least two or more 
hyperimmune serum reactive antigens or fragments thereof against S. pneumoniae. Optionally, such S 
pneumoniae hyperimmune serum reactive antigens or fragments thereof may also be combined with 
antigens against other pathogens in a combination pharmaceutical composition. Preferably, said 
pharmaceutical composition is a vaccine for preventing or treating an infection caused by S. pneumoniae 
and/or other pathogens against which the antigens have been included in the vaccine. 

According to a further aspect, the present invention relates to a pharmaceutical composition comprising a 
nucleic acid molecule encoding a hyperimmune serum-reactive antigen or a fragment thereof as 
identified above for S. pneumoniae. Such a pharmaceutical composition may comprise one or more nucleic 
acid molecules encoding hyperimmune serum reactive antigens or fragments thereof against S 
pneumoniae. Optionally, such S. pneumoniae nucleic acid molecules encoding hyperimmune serum 
reactive antigens or fragments thereof may also be combined with nucleic acid molecules encoding 
antigens against other pathogens in a combination pharmaceutical composition. Preferably said 
pharmaceutical composition is a vaccine for preventing or treating an infection caused by S. pneumoniae 
and/or other pathogens against which the antigens have been included in the vaccine. 

The pharmaceutical composition may contain any suitable auxiliary substances, such as buffer 
substances, stabilisers or further active ingredients, especially ingredients known in connection of 
pnarmaceuncal composition and/or vaccine production. 

A preferable carrier/or excipient for the hyperimmune serum-reactive antigens, fragments thereof or a 
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coding nucleic acid molecule thereof according to the present invention is an immunostimulatory 
compound for further stimulating the immune response to the given hyperimmune serum-reactive 
antigen, fragment thereof or a coding nucleic acid molecule thereof. Preferably the immunostimulatory 
compound in the pharmaceutical preparation according to the present invention is selected from the 
group of polycationic substances, especially polycationic peptides, immunostimulatory nucleic acids 
molecules, preferably immunostimulatory deoxynudeotides, alum, Freund's complete adjuvants, 
Freund's incomplete adjuvants, neuroactive compounds, especially human growth hormone, or 
combinations thereof. 



It is also within the scope of the present invention that the pharmaceutical composition, especially 
vaccine, comprises apart from the hyperimmune serum reactive antigens, fragments thereof and/or 
coding nucleic acid molecules thereof according to the present invention other compounds which are 
biologically or pharmaceutically active. Preferably, the vaccine composition comprises at least one 
polycationic peptide. The polycationic compound(s) to be used according to the present invention may be 
any polycationic compound, which shows the characteristic effects according to the WO 97/30721. 
Preferred polycationic compounds are selected from basic polyppetides, organic polycations, basic 
polyamino acids or mixtures thereof. These polyamino acids should have a chain length of at least 4 
amino acid residues (WO 97/30721). Especially preferred are substances like polylysine, polyarginine and 
polypeptides containing more than 20 %, especially more than 50 % of basic amino acids in a range of 
more than 8, especially more than 20, amino acid residues or mixtures thereof. Other preferred 
polycations and their pharmaceutical compositions are described in WO 97/30721 (eg 
polyethyleneimine) and WO 99/38528. Preferably these polypeptides contain between 20 and 500 amino 
acid residues, especially between 30 and 200 residues. 



These polycationic compounds may be produced chemically or recombinantly or may be derived from 
natural sources. 



Canonic (polypeptides may also be anti-microbial with properties as reviewed in {Ganz, T., 1999}. These 
(polypeptides may be of prokaryotic or animal or plant origin or may be produced chemically or 
recombinantly (WO 02/13857). Peptides may also belong to the class of defensins (WO 02/13857). 
Sequences of such peptides can be, for example, found in the Antimicrobial Sequences Database under 
the following internet address: 



http://ww w.bbcm.univ.trieste.it/--tossi/pag2.html 

Such host defence peptides or defensives are also a preferred form of the polycationic polymer according 
to the present invention. Generally, a compound allowing as an end product activation (or down- 
regulation) of the adaptive immune system, preferably mediated by APCs (including dendritic cells) is 
used as polycationic polymer. 

Especially preferred for use as polycationic substances in the present invention are cathelicidin derived 
antimicrobial peptides or derivatives thereof (International patent application WO 02/13857, incorporated 
herein by reference), especially antimicrobial peptides derived from mammalian cathelicidin, preferably 
from human, bovine or mouse. 

Polycationic compounds derived from natural sources include HIV-REV or HTV-TAT (derived cationic 
peptides, antennapedia peptides, chitosan or other derivatives of chitin) or other peptides derived from 
these peptides or proteins by biochemical or recombinant production. Other preferred polycationic 
compounds are cathelin or related or derived substances from cathelin. For example, mouse cathelin is a 
peptide which has the amino acid sequence NH2-RLAGLLRKGGEB3GEKLKKIGOKIKNFFQKLVPQPE- 
COOH. Related or derived cathelin substances contain the whole or parts of the cathelin sequence with at 
least 15-20 ammo acid residues. Derivations may include the substitution or modification of the natural 



WO 2004/092209 



PCT/EP2004/003984 



-40- 

amino acids by amino acids which are not among the 20 standard amino acids. Moreover, further cationic 
residues may be introduced into such cathelin molecules. These cathelin molecules are preferred to be 
combined with the antigen. These cathelin molecules surprisingly have turned out to be also effective as 
an adjuvant for an antigen without the addition of further adjuvants. It is therefore possible to use such 
cathelin molecules as efficient adjuvants in vaccine formulations with or without further 
immunactivating substances. 

Another preferred polycationic substance to be used according to the present invention is a synthetic 
peptide containing at least 2 KLK-motifs separated by a linker of 3 to 7 hydrophobic amino acids 
(International patent application WO 02/32451, incorporated herein by reference). 

The pharmaceutical composition of the present invention may further comprise immunostimulatory 
nucleic acid(s). Immunostimulatory nucleic acids are e. g. neutral or artificial CpG containing nucleic 
acids, short stretches of nucleic acids derived from non-vertebrates or in form of short oligonucleotides 
(ODNs) containing non-methylated cytosine-guanine di-nucleotides (CpG) in a certain base context (e.g. 
described in WO 96/02555). Alternatively, also nucleic acids based on inosine and cytidine as e.g. 
described in the WO 01/93903, or deoxynucleic acids containing deoxy-inosine and/or deoxyuridine 
residues (described in WO 01/93905 and PCT/EP 02/05448, incorporated herein by reference) may 
preferably be used as immunostimulatory nucleic acids for the present invention. Preferablly, the 
mixtures of different immunostimulatory nucleic acids may be used according to the present invention. 

It is also within the present invention that any of the aforementioned polycationic compounds is 
combined with any of the immunostimulatory nucleic acids as aforementioned. Preferably, such 
combinations are according to the ones as described in WO 01/93905, WO 02/32451, WO 01/54720 WO 
01/93903, WO 02/13857 and PCT/EP 02/05448 and the Austrian patent application A 1924/2001, 
incorporated herein by reference. 

In addition or alternatively such vaccine composition may comprise apart from the hyperimmune serum 
reactive antigens and fragments thereof, and the coding nucleic acid molecules thereof according to the 
present invention a neuroactive compound. Preferably, the neuroactive compound is human growth 
factor as, e.g. described in WO 01/24822. Also preferably, the neuroactive compound is combined with 
any of the polycationic compounds and/or immunostimulatory nucleic acids as afore-mentioned. 

In a further aspect the present invention is related to a pharmaceutical composition. Such pharmaceutical 
composition is, for example, the vaccine described herein. Also a pharmaceutical composition is a 
pharmaceutical composition which comprises any of the following compounds or combinations thereof: 
the nucleic acid molecules according to the present invention, the hyperimmune serum reactive antigens 
and fragments thereof according to the present invention, the vector according to the present invention, 
the cells according to the present invention, the antibody according to the present invention, the 
functional nucleic acids according to the present invention and the binding peptides such as the 
anticalines according to the present invention, any agonists and antagonists screened as described herein. 
In connection therewith any of these compounds may be employed in combination with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical carrier suitable 
for administration to a subject. Such compositions comprise, for instance, a media additive or a 
therapeutically effective amount of a hyperimmune serum reactive antigen and fragments thereof of the 
invention and a pharmaceutically acceptable carrier or excipient. Such carriers may include, but are not 
limited to, saline, buffered saline, dextrose, water, glycerol, ethanol and combinations thereof. The 
formulation should suit the mode of acuiuhistration. 

The pharmaceutical compositions may be aclministered in any effective, convenient manner including, 
for instance, administration by topical, oral, anal, vaginal, intravenous, intraperitoneal, intramuscular 
subcutaneous, intranasal, intratracheal or intradermal routes among others. 
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In therapy or as a prophylactic, the active agent may be administered to an individual as an injectable 
composition, for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application, for example in the form of 
ointments, creams, lotions, eye ointments, eye drops, ear drops, mouthwash, impregnated dressings and 
sutures and aerosols, and may contain appropriate conventional additives, including, for example, 
preservatives, solvents to assist drug penetration, and emollients in ointments and creams. Such topical 
formulations may also contain compatible conventional carriers, for example cream or ointment bases, 
and ethanol or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98 % by 
weight of the formulation; more usually they will constitute up to about 80 % by weight of the 
formulation. 



In addition to the therapy described above, the compositions of this invention may be used generally as a 
wound treatment agent to prevent adhesion of bacteria to matrix proteins exposed in wound tissue and 
for prophylactic use in dental treatment as an alternative to, or in conjunction with, antibiotic 
prophylaxis. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may be employed to 
enhance the immune response. A suitable unit dose for vaccination is 0.05-5 ug antigen / per kg of body 
weight, and such dose is preferably administered 1-3 times and with an interval of 1-3 weeks. 

With the indicated dose range, no adverse toxicologkal effects should be observed with the compounds 
of the invention, which would preclude their administration to suitable individuals. 

In a further embodiment the present invention relates to diagnostic and pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. The ingredient(s) can be present in a useful amount, dosage, formulation 
or combination. Associated with such container(s) can be a notice in the form prescribed by a 
governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological products 
reflecting approval by the agency of the manufacture, use or sale of the product for human" 
administration. 



In connection with the present invention any disease related use as disclosed herein such as, e. g. use of 
the pharmaceutical composition or vaccine, is particularly a disease or diseased condition which is 
caused by, linked or associated with Streptococci, more preferably, S. pneumoniae. In connection therewith 
it is to be noted that S. pneumoniae comprises several strains including those disclosed herein. A disease 
related, caused or associated with the bacterial infection to be prevented and/or treated according to the 
present invention includes besides others bacterial pharyngitis, otitis media, pneumonia, bacteremia, 
meningitis, peritonitis and sepsis in humans. 

In a still further embodiment the present invention is related to a screening method using any of the 
hyperimmune serum reactive antigens or nucleic acids according to the present invention. Screening 
methods as such are known to the one skilled in the art and can be designed such that an agonist or an 
antagonist is screened. Preferably an antagonist is screened which in the present case inhibits or prevents 
the bmdmg of any hyperimmune serum reactive antigen and fragment thereof according to the present 
invention to an interaction partner. Such interaction partner can be a naturally occurring interaction 
partner or a non-naturally occurring interaction partner. 

The invention also provides a method of screening compounds to identify those, which enhance (agonist) 
or block (antagonist the function of hyperimmune serum reactive antigens and fragments thereof or 
nucleic acid molecules of the present invention, such as its interaction with a binding molecule. The 
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method of screening may involve high-throughput. 

For example, to screen for agonists or antagonists, the interaction partner of the nucleic acid molecule and 
nucleic acid, respectively, according to the present invention, maybe a synthetic reaction mix, a cellular 
compartment, such as a membrane, cell envelope or cell wall, or a preparation of any thereof may be 
prepared from a cell that expresses a molecule that binds to the hyperimmune serum reactive antigens 
and fragments thereof of the present invention. The preparation is incubated with labelled hyperimmune 
serum reactive antigens and fragments thereof in the absence or the presence of a candidate molecule, 
which may be an agonist or antagonist. The ability of the candidate molecule to bind the binding 
molecule is reflected in decreased binding of the labelled ligand. Molecules which bind gratuitously, i e 
without inducing the functional effects of the hyperimmune serum reactive antigens and fragments 
thereof, are most likely to be good antagonists. Molecules that bind well and elicit functional effects that 
are the same as or closely related to the hyperimmune serum reactive antigens and fragments thereof are 
good agonists. 

The functional effects of potential agonists and antagonists may be measured, for instance, by 
deterrrurung the activity of a reporter system following interaction of the candidate molecule with a cell 
or appropriate cell preparation, and comparing the effect with that of the hyperimmune serum reactive 
antigens and fragments thereof of the present invention or molecules that elicit the same effects as the 
hyperimmune serum reactive antigens and fragments thereof. Reporter systems that may be useful in this 
regard include but are not limited to colorimetric labelled substrate converted into product, a reporter 
gene that is responsive to changes in the functional activity of the hyperimmune serum reactive antigens 
and fragments thereof, and binding assays known in the art. 

Another example of an assay for antagonists is a competitive assay that combines the hyperimmune 
serum reactive antigens and fragments thereof of the present invention and a potential antagonist with 
membrane-bound binding molecules, recombinant binding molecules, natural substrates or ligands or 
substrate or ligand mimetics, under appropriate conditions for a competitive inhibition assay. The 
hyperimmune serum reactive antigens and fragments thereof can be labelled such as by radioactivity or a 
colorimetric compound, such that the molecule number of hyperimmune serum reactive antigens and 
fragments thereof bound to a binding molecule or converted to product can be determined accurately to 
assess the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and antibodies that bind to 
a hyperimmune serum reactive antigen and fragments thereof of the invention and thereby inhibit or 
extinguish its acitivity. Potential antagonists also may be small organic molecules, a peptide a 
polypeptide such as a closely related protein or antibody that binds to the same sites on a binding 
molecule without inducing functional activity of the hyperimmune serum reactive antigens and 
fragments thereof of the invention. 

Potential antagonists include a small molecule, which binds to and occupies the binding site of the 
hyperimmune serum reactive antigens and fragments thereof thereby preventing binding to cellular 
binding molecules, such that normal biological activity is prevented. Examples of small molecules 
include but are not limited to small organic molecules, peptides or peptide-like molecules. 

nu?nn C °™ Agonists include antisense molecules (see {Okano, H. et al., 1991}; 
^ ^^ CLEanDES AS ANT^NSE INHIBITORS OF GENE EXPRESSION; CRC Press, Boca 
Ration, FL (1988), for a description of these molecules). 

hSU^t P?^?™*? 0 ™* delude derivatives of the hyperimmune serum reactive antigens and 
tragments thereof of the invention. 
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£SET . h ' T m ° leCUleS ° r ot *e invention to interfere wi^rL^al 

C ° mPnSmg """"^ *™< Paramagnedc hS!2£ 

P ™ n ° y o£ """Stotion only and not by way of limitation of the disclosure. 
In connection with the present invention 

Figure 1 shows the characterizadon of S. pneumoniae specific human sera. 

F Zl^,£Z S Z\T° aeti2 ° a ° n " "» — ' library, LSF.-70, from St^tococcus 

Figure 3 shows the selecdon of bacterial cells by MACS using Methylated human IgGs. 
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Figure 4 shows an example for the gene distribution study with the identified antigens. 

Figure 5 shows examples of changes in epitope-spetific antibody levels in the different age groups and 
during the course of pneumococcal disease. 

Figure 6 shows examples for cell surface staining with epitope-spetific antisera by flow cytometry. 

Figure 7 shows the determination of bactericidal activity of antibodies induced by selected epitopes in an 
in vitro assay. 

Figure 8 shows the protective effect of active immunization with selected S. pneumoniae antigens in a 
murine lethality sepsis model. 

Figure 9 shows the protective effect of passive immunization with sera generated with selected S 
pneumoniae antigens in a murine lethality sepsis model. 

Figure 10 shows the identification of the protective domain within the SP2216 antigen. 

Figure 11 shows that antibodies induced by protective antigens are cross-reactive with the different S 
pneumoniae serotypes. 

Figure 12 shows the alignment of amino acid sequences of natural SP2216 variants 
Figure 13 shows the alignment of amino acid sequences of natural SP1732 variants 
Figure 14 shows the alignment of amino acid sequences of natural SP2190 variants 

Table 1 shows the summary of all screens performed with genomic S. pneumoniae libraries and human 
serum. 

Table 2 shows the summary of epitope serology analysis with human sera. 

Table 3 shows the summary of the gene distribution analysis for the identified antigens in 50 S 
pneumoniae strains. 

Table 4 shows the summary of the surface staining and bactericidal activity measurements. 

The^gures to which it might be referred to in the specification are described in the following in more 

Figure 1 shows the characterization of human sera for anti-S. pneumoniae antibodies ass measured by 
immune assays. Total anti-S. pneumoniae IgG and IgA antibody levels were measured by standard ELISA 
using total bacterial lysates or culture supernant fractions prepared from S. pneumoniae serotype 4 capsule 
negative mutant strain as coating antigens. 97 serum samples from convalescing patients with inasive 
diseases or 50 sera from healthy adults without nasopharyngeal carriage of S. pneumoniae were analysed 
at three different serum dilutions. Results of representative experiments are shown with (A) patients' sera 
wnh bacterial lysate and (B) healthy adult sera with culture superntant proteins. Data are expressed as 
Calculated from abs °rt>ance at 405nm at a serum dilution in the linear range of detection 
(10.000X for IgA 50,000 for IgG). 2x5 sera from both donor groups were selected and pooled for antigen 

ptS^T^r I f 1 T"*™* diSplay - Sel6Cted sera * cluded ™ * e two patient <FSPn3-IgG,-IgA and 
PSPn7-IgG) and two healthy pools (NSPn4-IgG,-IgA and NSPn5-IgG) are indicated by tircules. (C) 
Immunoblot analysis was performed on sera pre-selected by ELISA in order to ensure multiple immune 
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reachvity with protein antigens. Results of a representative experiment using total bacterial lvsate 

S2mE?i h m T"" 0 "^ " a ? ype 4 031,81116 negatiVe mutant ^ md selecte * patients' sera at 
5.000X dilution are showi. Not selected, low titer sera were included as negative controls Mw: molecular 

v^tn^VAS, ° f * PMmmkm S6r0type 4 ^ -tanTs'atwS 

performed by FACS to compare antibody binding to surface located antigens. Human sera were used at 

z ^aZ c T r { r o) - L epresentative data « shown — -ed r 0 i% ftai 

concentration. Signal was detected with FTTC-labeled anti-human IgGFab and analysed with the 
computer program CELLQuest. (E) Correlation between IgG titers measured by ElIa usZT toTa! 

™r y ^m^^r^s 18 expressed as mean flu °™ * — 

Figure 2 (A) shows Je fragment size distribution of the Streptococcus pneumoniae type 4 small fragment 
genomic library, LSPn-70. After sequencing 609 randomly selected /ones sequent wie «n3To 
ebnunate vector residues and the number of clones with various genomic fragment sizeTwe^otted 
70o^Z^ ° f dbtribuU ° n ° f Same S6t o^omly silenced clones of ^£ 

^Tci te ZT'TZ ^T*™* t0 TOR4 gen0me data >" <* des ^icate matching 
seou!™ in^T 1/ reCtangleS repr6Sent Mly matched dones to non-coding chromosomal 

sequences in + / + or + /- orientation. Diamonds position all clones with chimeric sequences NumSic 

donTs'I^T^ . tad,Mtod OV6X ^ drCUlar geri0me for orient ^n. Parting of v^s 
clone sets withm the library is given in numbers and percentage at the bottom of the figure. 

Figure 3 (A) shows the MACS selection with biotinylated human IgGs. The LSPn-70 library in pMAL9 1 
was screened with 10 ug biotinylated IgG (PSPn3-IgG, purified fromhuman serum) As n^ative^ 
no serum was added to the library cells for screening. Number of cells selected after the 1 st and 2 nd 

t^ZT f OWn n ^ S6le ? i0n rOUnd (UPPer l0Wer P»* respectively). (B) showTme r'clivity 
of specific clones (1-26) selected by bacterial surface display as analysed by immunoblot analy L wS Z 
human serum IgG pool (PSPn7-IgG, 4ug/ul) used for selection by MACS at a ^^ IZZ aTI 

L, S i C ° h , « 6 7J* ^ W3S 3180 " nal y ed With -tibodies directed against the pStf^m prolein 
LamB at a dilution of 1:5,000 of hyperimmune rabbit serum T B Fvh^ a™ i P iatrorm P^otern 

withoutforeignpeptideinsert. ' 3Ct fr ° m 3 d ° ne ex P ressm 8 LamB 

Sf^nt^w^ ti ° n ° f Serot yP es of S - Pneumoniae clinical isolates analysed for 

TI^TJT T fX* 7 '™ Sh ° WS ^ PCR fOT * e S-e distribution of SP1604 w «h £e 

X^n^ + ' — -A from , P _ 

Figure 5 shows the ELBA measurement of epitope-specific human serum IgG antibody levels durin* 

M-1S37, SP2190 and SP2216, as indicated. Biotin-labeled peptides were reacted with h, m L 

eampleaatZOOXandLOOOXdilutto^anddaaareexpressedasELISA™! m 

^Tw^™Sre 5A Sent 3 "- 1 ' 0 '' 7 ^ " *-» ■"*"» * S »**~ 

anti^, anti-sl^ and^,™™^^^ 
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the platform protein LamB. 

Figure 7 shows the bactericidal activity of epitope specific antibodies as determined in in vitro killing 
assay. The killing activity of immune sera is measured parallel with and calculated relative to the 
appropriate control sera. Data are expressed as percentage of killing, that is the reduction on bacterial cfu 
numbers as a consequence of the presence of antibodies in hyperimmune (HI) polyclonal mouse sera 
generated with S. pneumoniae lysate (A), in immune sera generated with SP0117 epitopes expressed in the 
LamB platform protein (B), and in mouse immune sera generated with SP1287 epitopes expressed in the 
FhuA platform protein (O. The control sera represent preimmune sera (PI), sera induced with Lamb or 
FhuA expressing E. coli clones without S. pjiewmoma-derived epitopes. S. pneumoniae serotype 4 cells were 
incubated with mouse phagocytic cells for 60 min, and surviving bacteria were quantified by counting 
cfus after plating on blood agar. 

Figure 8 shows the protection achieved by active immunization with selected S. pneumoniae antigens in a 
mouse lethality model. C3H mice (10 in each test groups) were immunized with recombinant antigens 
cloned from a serotype 4 S. pneumoniae strain and challenged with serotype 6B strain. Survival was 
monitored for 2 to 3 weeks post-challenge. A: Mice were immunized with SP0368, SP0667, SP2190 and 
SP2216 antigens and challenged intravenously with 10« cfu bacteria. Nonimmunized mice were used as 
negative controls, while PspA (SP0117) served as positive control. B: Mice were immunized with SP2190 
and SP2216 antigens and challenged intraperitonealy with 10* cfu bacteria. Mice injected with PBS or 
mock immunized with the adjuvants only (CFA/IFA) were used as negative controls, while PspA 
(SP0117) served as positive control. C: Mice were immunized with SP0498 and SP1732 antigens and 
challenged intraperitonealy with 10* cfu bacteria. Mice injected with PBS were used as negative controls 
whUe PspA (SP0117) served as positive control. 

Figure 9 shows the protection achieved by passive immunization with hyperimmune mouse sera 
generated with selected S. pneumoniae antigens in a mouse lethality model. C3H mice (10 in each test 
groups) were given mouse sera intraperitoneally 2 hrs before intraperitoneal challenge with 10* cfu S 
pneumoniae serotype 6B bacteria. Survival was monitored for 3 weeks post-challenge. 150ul immune sera 
generated with SP2190 or SP2216 were given and supplemented with 150ul serum from naive mice 
except for mice receiving lOOul each of anti-SP2190, anti-SP2216 immune sera and 100 ul serum from 
naive mice. Negative controls were treated either with 300ul sera from PBS injected, noninjected (naive) 
or nonimmune CFA/IFA injected mice. 

Figure 10 shows the identification of the protective domain within the SP2216 antigen. A: Schematic 
representation of the SP2216 antigen indicating the two subdomains predicted by in silico (structural 
prediction) analysis and the localization of epitopes identified by bacterial surface display (grey bars and 
arrows). B: C3H mice (10 in each test groups) were immunized with recombinant SP2216 antigens- full- 
length, N-terminal or C-terminal domains and challenged with S. pneumoniae serotype 6B strain given 
^FnfT^ 7 CfUS - SurViVal Was monitore d for 2 to 3 weeks post-challenge. Nonimmunized 
(CFA/IFA adjuvant injected) mice were used as negative controls, while PspA (SP0117) served as positive 
control. ' 

Figure 11 shows the cross-reactivity of antibodies by analysing different S. pneumoniae serotypes. 
Immunoblot analysis was performed with bacterial lysates prepared from 60 clinical isolates of S 
pneumoniae representing 48 different serotypes and using sera generated with SP1732, SP2190 and SP2216 
recombinant antigens cloned from a serotype 4 strain in order to test the cross-reactivity of antibodies. 
Results with seven different serotypes (Lanes 1-7) are shown as representative data taken from the 
complete analysis. Mw: molecular weight markers. 

Figure 12 shows the amino acid exchanges detected in natural SP2216 variants expressed in different 
clinical isolates of S. pneumoniae. The SP2216 gene from 47 different clinical isolates representing 47 
different S. pneumoniae serotypes were analysed by DNA sequencing. The translated amino acid 
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S ^^^r£ OVm ^ Ser0typ6S Where amino acid «*-ng« were detected relative to the 
published TTGR4 genome sequences. 

Figure 13 shows the amino acid exchanges detected in natural SP1732 variants expressed in different 
clinical isolates of S. pneumoniae. The SP1732 gene from six different clinical isolates representing six 

ZliH PMeMm ° nMe -, Ser0tyPeS (4 ' 6B ' 9V ' 18Q 19F ' ^ WCTe ^sed by DNA sentencing The 
translated amino acid sequences are shown for those serotypes where amino acid exchanges were 
detected relative to the published TIGR4 (serotype 4) genome sequences. excnanges were 

cSnltr^ 6 amin0 ^^r^ d6teCted * natUraI SP219 ° variants * different 

chmcal isolates of S. pneumoniae. The SP2190 gene from seven different clinical isolates representing seven 
major S pneumoniae serotypes (4, 6B, 9V, 14, 18 Q 19F, 23F) were analysed by DNA sequencing. The 

detected relative to the published TIGR4 (serotype 4) genome sequences. Due to missing Sequence 
information from the middle part of some of the genes, the N-teiminal and C-terminal aa aliJunen^re 
shown separately. A: N-terminal amino acid sequences; B: C-terminal amino acid sequence! 

Table 1: Immunogenic proteins identified by bacterial surface display 

A, 300bp hbraxy in fhuA with NSPn4-IgA (362), B, 300bp library in fhuA with NSPn4-IgG (832) C 300bo 
hbrary m fhuA with NSPn5-IgG (872), D, 300bp library in fhuTwith PSPn3-IgA (361)^30^^ 

N^r;™^?^-!' * ^ < 795 >' G ™>P ^rary folamB^im 

™ ' i P h * laiy ™ lamB NSPn4-IgG (929), I, 70bp library in lamB with NSPn5-IgG 

5f\5 • ^ *7 m ^ PSPn3 - I e A < 1121 >' L - ™>P hbrary in lamB with PSPn3-IgG (1242) M 
70bp library in lamB with PSPn7-IgG (514); • prediction of antigenic sequences longer than 5 ammfacids 
was performed with the program ANTIGENIC {Kolaskar, A. et al., 1990}. 

Table 2: Epitope serology with human sera. 

^T^T^Zl^^ ^ re P resentin S ^opes with individual human 

sera is shown. Extent of reactivity is pattern/grey coded; white: - (<50U), light grey: + (50-119U) dark 
gery: ^0-1991^ black: +♦+ (200-500U) and vertically crossed: Vh- (> loOl^ ELIsi unS^J) 
calculated from OD^, readings and the serum dilution after correction for background. S stands for 
score, calculated as the sum of all reactivities (addition of the number of all + ), PI to Pl^era Z 
measured to be high titer and are from patients with invasive penumococcal diseases and NT o nTo sSa 
are from healthy adults with high anti-S. pneumoniae titers. S stands for score. Which is Se sum of 
immune reaches: - =0; + =1; ++ = 2 ; +«. «3 and +«+ =4. Location of synthetic pepude^tnTtne 

tf % a T^ 40 *" gen ° me ~ Wta ° f TOR4 stra * - A - columnf frorn^d to 
indicatxng *e first and last amino acid residues, respectively. Peptide names: SP0117.1-7 present fo 

££££ T «°? 17; t^ 0408 - 1 ' POtential n ° Vel ORF ^ alt *™«ve reading-frame of Smo£ 
CRF0129.1, potenhal novel ORF on complement of SP0129. 

Table 3: Gene distribution in S. pneumoniae strains. 

fb^f St ?™ 38 ShOWn M RgUre ^ Were tested ^ PCR with oligonucleotides specific for 

*e genes encoding relevant antigens. The PCR fragment of one selected PCR fragment was sequenced fo 

l—T^Z^^l ^ COrr6Ct DNA fra ^ * — *« - anTo acid suSrSZns fo 
LmTSt H compared to S. pneumoniae TIGR4 (serotype 4). #, alternative strain used for 

sequencing, because gene was not present in the serotype 14 strain. 

Table 4: Surface location of antigenic epitopes and the functionality of the epitope-specific antibodies 
tL; r rrr nme antigGnS ^ tGSted for surface localization in the way dLcrLd and Tpr^ented ^ 
enSdwiT T? m ° USe 5613 8enerated * ^^«on with £. coli clones harboring pTasmid" 

column labeled FACS. The very same immune reagents were used in an in vitro killing assay, as 
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shown in Figure 7 for the examples, and presented for all antigens tested positive by FACS in column PK 
(phagocytic killing). -: negative result, +: not consistently positive in all assays performed, ++ and +++ are 
consistently positive relative to control reagents. 

EXAMPLES 

Example Is Characterization and selection of human sera based anti-S. pneumoniae antibodies, 
preparation of antibody screening reagents 

Experimental procedures 

Enzyme linked immune assay (ELISA). 

ELBA plates (Maxisorb, Millipore) were coated with 5-10 ug/ml total protein diluted in coating buffer 
(0 1M sodium carbonate pH 92). Three dilutions of sera (2,000X, 10,000X, 50,000X) were made in PBS- 
^Shty s P edfic Horse Peroxidase (HRP)-conjugated anti-human IgG or anti-human IgA 

secondary antibodies (Southern Biotech) were used according to the manufacturers' recommendations 
(dilution: 1 OOOx). Antigen-antibody complexes were quantified by measuring the conversion of the 
} *° P roduct based on OD«- readings by automatic EUAS reader (TECAN 

Preparation of bacterial antigen extracts 

Total bacterial lysate: Bacteria were grown overnight in THB (Todd-Hewitt Broth) and lysed by repeated 
freeze-thaw cycles: incubation on dry ice/ethanol-mixture until frozen (1 min), then thawed at 37<>C (5 
mm): repeated 3 times. This was followed by sonication and collection of supernatant by centrifugation 
(3,500 rpm, 15 nun, 4°C). ° 

Culture supernatant: After removal of bacteria by centrifugation, the supernatant of overnight grown 
bacterial cultures was precipitated with ice-cold ethanol by mixing 1 part supernatant with 3 parts abs 
ethanol and mcubated overnight at -20°C. Precipitates were collected by centrifugation (2,600 g, for 15 
rmn).Dry pellets were dissolved either in PBS for ELISA, or in urea and SDS-sample buffer for SDS- 
PAGE and immunoblotting. The protein concentration of samples was determined by Bradford assay. 

Immunoblotting 

Total bacterial lysate and culture supernatant samples were prepared from in vitro grown S. pneumoniae 
serotype 4 uncapsulated mutant strain 10 to 25ug total protein/lane was separated by SDS-PAGE using 
the BioRad Mini-Protean 3 Cell electrophoresis system and proteins transferred to nitroceUulose 
' A f^ sham Pfca^acia). After overnight blocking in 5% milk, human sera were added at 
2,000x dilution, and HRPO labeled anti-human IgG was used for detection. 

Surface staining of bacteria 

Flow cytometric analysis was carried out as follows. S. pneumoniae serotype 4 uncapsulated mutant strain 
was grown in Todd-Hewitt broth overnight until early stationary phase. Cells were collected and washed 

™ - m ^ S JSf ^ ^ S ° 1Uti0n (HBSS ) md *• ceU density was adjusted to approximately 1 X 10* 
CFU m lOOul HBSS with 0.5% BSA based on OD600 run readings. After incubation with human sera at 
0,5 and 2 A hnal concentration for 60 min at 4»C, unbound antibodies were washed away by 

h!,? K^^wf °- 5% BSA - F ° r det6Cti0n fluo ^cein (FTTC) labeled secondary goat anti- 

human IgG (F(ab ) 2 fragment specific) was incubated with the cells at 4°C for 30 min. After washing the 

B^nn S 2% Paraf0rmaldeh y de - Surface staining antibodies were detected usmg a 

CEt L ^esf nS ° n Cyt ° meter ^ m * yZed With COmputer P«*«* 

seS°^Jr m >?*^ g 7 0miC Rve Seta from both * e patient «* *e healthy group were 

selected based on the overall anti-streptococcal titers for a serum pool used in the screening procedure. 
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Antibodies against E. coli proteins were removed by incubating the heat-inactivated sera with whole cell 
E. coli cells (DH5alpha, transformed with pHIEll, grown under the same condition as used for bacterial 
surface display). Highly enriched preparations of IgGs from the pooled, depleted sera were generated by 
protein G affinity chromatography, according to the manufacturer's instructions (UltraLink Immobilized 
Protein G, Pierce). IgA antibodies were purified also by affinity chromatography using biotin-labeled 
anti-human IgA (Southern Biotech) immobilized on Streptavidin-agarose (GIBCO BRL). The efficiency of 
depletion and purification was checked by SDS-PAGE, Western blotting, ELISA and protein 
concentration measurements. **" r 



Results 



The antibodies produced against S. pneumoniae by the human immune system and present in human sera 
are indicative of the in vivo expression of the antigenic proteins and their immunogenidty. These 
molecules are essential for the identification of individual antigens in the approach as described in the 
present invention, which is based on the interaction of the specific anti-streptococcal antibodies and the 
corresponding S. pneumoniae peptides or proteins. To gain access to relevant antibody repertoires, human 
sera were collected from 

I. convalescent patients with invasive S. pneumoniae infections, such as pneumonia, bacteraemia 
and meningitis. (S. pneumoniae was shown to be the causative agent by medical microbiological tests), 

H healthy adults without carriage at the time of sampling. S. pneumoniae colonization and 
infections are common, and antibodies are present as a consequence of natural immunization from 
previous encounters. 

97 serum samples from patien and 50 sera from healthy adults were characterized for anti-S. pneumoniae 
antibodies by a series of immune assays. Primary characterization was done by ELISA using two 
different antigen preparations, such as total bacterial extract and culture supernatant proteins prepared 
from S. pneumoniae serotype 4 uncapsulated mutant strain. It is an important aspect that we analysed 
uncapsulated strain, since we avoided the reactivities coming from serotype specific abundant anti- 
capsular polysaccharide antibodies. 

Recently it was reported that not only IgG, but also IgA serum antibodies can be recognized by the FcRDI 
receptors of PMNs and promote opsonization {Phillips-Quagliata, J. et al„ 2000}; {Shibuya, A. et al„ 2000}. 
The primary role of IgA antibodies is neutralization, mainly at the mucosal surface. The level of serum 
IgA reflects the quality, quantity and specificity of the dimeric secretory IgA. For that reason the serum 
collection was not only analyzed for anti-streptococcal IgG, but also for IgA levels. In the ELBA assays 
highly specific secondary reagents were used to detect antibodies from the high affinity types, such as 
IgG and IgA, but avoided IgM. Production of IgM antibodies occurs during the primary adaptive 
humoral response, and results in low affinity antibodies, while IgG and IgA antibodies had already 
undergone affinity maturation, and are more valuable in fighting or preventing disease. Antibody titers 
were compared at given dilutions where the response was linear (Fig. 1A and IB.). Sera were ranked 
based on the IgG and IgA reactivity against the two complex antigenic mixtures, and the highest ones 
were selected for further testing by immunoblotting. This analysis confirmed a high antibody reactivity 
of the pre-selected sera against multiple pneumococcal proteins, especially when compared to not 
selected, low-titer sera (Fig 1C). ELISA ranking of sera also correlated very well with surface staining of 
the same S. pneumoniae strain (Fig. ID and IE) suggesting that the majority of the antibodies detected by 
ELISA corresponded to surface antigens. This extensive antibody characterization approach has led to the 
unambiguous identification of anti-pneumococcal hyperimmune sera. 

Selected sera, 2x5 from both the patient and healthy donor groups were pooled to further enrich for 
abundant antibodies, but still having a representation of antibody repertoires- of different individuals 
IgG and IgA antibodies were purified from pooled sera by affinity chromatography and depleted of E. 
coh -reactive antibodies to avoid background in the bacterial surface display screen 
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Example 2: Generation of highly random, frame-selected, small-fragment, genomic DNA libraries of 
Streptococcus pneumoniae 

Experimental procedures 

Preparation of streptococcal genomic DNA 50 ml Todd-Hewitt Broth medium was inoculated with S 
pneumoniae serotype 4 (clinical isolate, typed with conventional serotyping) bacteria from a frozen stab 

TfnE OV T T c aerati0n ^ Shakmg fOT 18 h at 37 ° C - ^ was harvested, centrifuged with 

l,6UUx g for 15 mm and the supernatant was removed. Bacterial pellets were washed 3 x with PBS and 
carefully re-suspended in 0.5 ml of Lysozyme solution (100 mg/ml). 0.1 ml of 10 mg/ml heat treated 
RNase A and 20 U of RNase Tl were added, mixed carefully and the solution was incubated for 1 h at 
37 G Following the addition of 0.2 ml of 20 % SDS solution and 0.1 ml of Proteinase K (10 mg/ml) the 
tube was incubated overnight at 55°C. 1/3 volume of saturated NaCl was then added and the solution 
was incubated for 20 min at 4°C. The extract was pelleted in a mkrofuge (13,000 rpm) and the 
SU . P i er ^ a T a ^, t * ansferred mto a new ^e. The solution was extracted with PhOH/CHCtylAA (2524-1) and 
with CHCb/IAA (24:1). DNA was precipitated at room temperature by adding 0.6x volume of 
Isopropanol spooled from the solution with a sterile Pasteur pipette and transferred into tubes 
containing 80 /„ ice-coW ethanol. DNA was recovered by centrifuging the precipitates with 10-12,000x g, 
then dried on air and dissolved in ddHjO. ^ 

Preparation of small genomic DNA fragments. Genomic DNA fragments were mechanically sheared into 
fragments ranging in size between 150 and 300 bp using a cup-horn sonicator (Bandelin Sonoplus UV 
2200 somcator ecmipped with a BBS cup horn, 10 sec. pulses at 100 % power output) or into fragments of 
size between 50 and 70 bp by mild DNase I treatment (Novagen). It was observed that sonicaticTyielded 
a much tighter fragment size distribution when breaking the DNA into fragments of the 150-300 bp size 
range. However, despite extensive exposure of the DNA to ultrasonic wave-induced hydromechanical 
sheanng force, subsequent decrease in fragment size could not be efficiently and reproducibly achieved 
Therefore fragments of 50 to 70 bp in size were obtained by mild DNase I treatment using Novagen's 
shotgun cleavage kit. A 1:20 dilution of DNase I provided with the kit was prepared and L digestion 
was performed in the presence of MnCb. in a 60 ul volume at 20°C for 5 min to ensure double-stranded 
cleavage by the enzyme. Reactions were stopped with 2 pi of 0.5 M EDTA and the fragmentation 
efficiency was evaluated on a 2% TAE-agarose gel. This treatment resulted in total fragmentation of 
genomic DNA into near 50-70 bp fragments. Fragments were then blunt-ended twice using T4 DNA 
Polymerase m the presence of 100 uM each of dNTPs to ensure efficient flushing of the ends. Fragments 
were used immediately m ligation reactions or frozen at -20°C for subsequent use. 

Desmpton of the vectors. The vector pMAL4.31 was constructed on a pASK-IBA backbone {Skerra, A., 
1994} with the beta-lactamase (bla) gene exchanged with the Kanamydn resistance gene. In addition the 

%JZ*T£"* *" d0ninS ^ ™ 6 S6qUenCe enCOdin S mah " e ^ta-lactamase is 

preceded by the leader peptide sequence of ompA to allow efficient secretion across the cytoplasmic 

irr^ 6 ; ^ he " nOTe a ^nce encoding the first 12 amino acids (spacer sequence) of mature beta- 
actamase follows tiie ompA leader peptide sequence to avoid fusion of sequences immediately after the 
T"? P e P«dase ^ ava ite/ since e g clusters of positive amin<> ^ fa ^ ^ 

n^TT or . f abohsh translocation across the cytoplasmic membrane {Kajava, A. et al., 2000}. A Smal 
r^tnchon site serves for library insertion. An upstream Fsel site and a downstream NotI site, which were 
used for recovery of the selected fragment, flank the Smal site. The three restriction sites are inserted after 
Ae sequence encoding the 12 amino acid spacer sequence in such a way that the bla gene is transcribed Z 

so'tntTb^T 1 : * 3 St ° P COd ° n 15 bp aft6r me N0tI Site - A +1 b P ^ores the Ma 

S ° *"* beta - lactamase P rote * ^ produced with a consequent gain of Ampicillin resistance. 

I"*"* ft^'K™ COnstructed b y donin S *e lamB gene into the multiple cloning site of pEHl 
{Hashemzadeh-Bonehi, L. et al., 1998}. Subsequently, a sequence was inserted m lam^tH^l 
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154, containing the restriction sites Fsel, Smal and NotL The reading frame for this insertion was 
constructed in such a way that transfer of frame-selected DNA fragments excised by digestion with Fsel 
and NotI from plasmid pMAL4.31 yields a continuous reading frame of lamB and the respective insert. 

The vector pMALlO.l was constructed by cloning the btuB gene into the multiple cloning site of pEHl. 
Subsequently, a sequence was inserted in btuB after amino acid 236, containing the restriction sites Fsel, 
Xbal and Noil. The reading frame for this insertion was chosen in a way that transfer of frame-selected 
DNA fragments excised by digestion with Fsel and NotI from plasmid pMAL4.31 yields a continuous 
reading frame of btuB and the respective insert. 

The vector pHIEll was constructed by cloning HxefliuA gene into the multiple cloning site of pEHl 
Thereafter, a sequence was inserted in fhuA after amino acid 405, containing the restriction site Fsel, Xbal 
and NotI. The reading frame for this insertion was chosen in a way that transfer of frame-selected DNA 
fragments excised by digestion with Fsel and NotI from plasmid pMAL4.31 yields a continuous reading 
frame of fhuA and the respective insert 

Cloning and evaluation of the library for frame selection. Genomic S. pneumoniae DNA fragments were Iigated 
into the Smal site of the vector pMAL4.31. Recombinant DNA was electroporated into DH10B 
electrocompetent E. coli cells (GIBCO BRL) and transformants plated on LB-agar supplemented with 
Kanamycin (50 ug/ml) and Ampicillin (50 ug/ml). Plates were incubated over night at 37°C and colonies 
collected for large scale DNA extraction. A representative plate was stored and saved for collecting 
colonies for colony PCR analysis and large-scale sequencing. A simple colony PCR assay was used to 
initially determine the rough fragment size distribution as well as insertion efficiency. From sequencing 
data the precise fragment size was evaluated, junction intactness at the insertion site as well as the frame 
selection accuracy (3n+l rule). 

Cloning and evaluation of the library for bacterial surface display. Genomic DNA fragments were excised from 
the P MAL4.31 vector, containing the S. pneumoniae library with the restriction enzymes Fsel and NotI The 
entire population of fragments was then transferred into plasmids pMAL9.1 (LamB) or pHIEll (FhuA) 
which have been digested with Fsel and NotI. Using these two restriction enzymes, which recognise an 8 
bp GC rich sequence, the reading frame that was selected in the pMAL4.31 vector is maintained in each of 
the platform vectors. The plasmid library was then transformed into E. coli DH5alpha cells by 
electroporation. Cells were plated onto large LB-agar plates supplemented with 50 ug/ml Kanamycin and 
grown over night at 37°C at a density yielding clearly visible single colonies. Cells were then scraped off 
Ae surface of these plates, washed with fresh LB medium and stored in aliquots for library screening at - 

Results 

Libraries for frame selection. Two libraries (LSPn70 and LSPn300) were generated in the pMAL4.31 vector 
with sizes of approximately 70 and 300 bp, respectively. For each library, ligation and subsequent 
transformation of approximately 1 ug of pMAL4.31 plasmid DNA and 50 ng of fragmented genomic S 
pneumoniae DNA yielded 4x 10* to 2x 10* clones after frame selection. To assess the randomness of the 
libraries, approximately 600 randomly chosen clones of LSPn70 were sequenced. The bioinformatic 
analysis showed that of these clones only very few were present more than once. Furthermore, it was 
shown that 90/o of the clones fell in the size range between 25 and 100 bp with an average size of 52 bp 
seized ^ sequences followed the 3n+l rule, showing that all clones were properly frame 

Bacterial surface display libraries. The display of peptides on the surface of E. coli required the transfer of the 
TlTo'fT ^ libraries from the frame selection vector pMAL4.31 to the display plasmids 
PMAL9.1 (LamB) or pHIEll (FhuA). Genomic DNA fragments were excised by Fsel and No I lirtrtZ 
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and ligation of 5ng inserts with O.lug plasmid DNA and subsequent transformation into DHSalpha cells 
resulted in 2-5x IV clones. Hie clones were scraped off the LB plates and frozen without further 
amplification. 

Example 3: Identification of highly immunogenic peptide sequences from S. pneumoniae using 
bacterial surface displayed genomic libraries and human serum 

Experimental procedures 

MACS screening. Approximately 2.5x W cells from a given library were grown in 5 ml LB-medium 
SU ffJ^ ted 50 K8/ml i°* 2 h at 37°C. Expression was induced by the addition of 1 

mM IPTG for 30 min. CeUs were washed twice with fresh LB medium and approximately 2x 10' cells re- 
suspended m 100 ul LB medium and transferred to an Eppendorf tube. 

10 ug of biotinylated, human IgGs purified from serum was added to the cells and the suspension 
incubated overnight at 4°C with gentle shaking. 900 ul of LB medium was added, the suspension mixed 
and subsequently centrifuged for 10 min at 6,000 rpm at 4°C (For IgA screens, 10 ug of purified IgAs 
were used and these captured with biotinylated anti-human-IgG secondary antibodies). Cells were 
washed once with 1 ml LB and then re-suspended in 100 ul LB medium. 10 ul of MACS microbeads 
C °^i e t!° stre P tavidin (Miltenyi Biotech, Germany) were added and the incubation continued for 20 min 
at 4 C Thereafter 900 ul of LB medium was added and the MACS microbead cell suspension was loaded 
onto the equilibrated MS column (Miltenyi Biotech, Germany) which was fixed to the magnet. (The MS 
columns were equilibrated by washing once with 1 ml 70% EtOH and twice with 2 ml LB medium.) 

The column was then washed three times with 3 ml LB medium. After removal of the magnet, cells were 
e uted by washing with 2 ml LB medium. After washing the column with 3 ml LB medium, the 2 ml 
eluate was loaded a second time on the same column and the washing and elution process repeated. The 
loading, washing and elution process was performed a third time, resulting in a final eluate of 2 ml. 

A second round of screening was performed as follows. The cells from the final eluate were collected by 
centrifugation and re-suspended in 1 ml LB medium supplemented with 50 ug/ml Kanamycin. The 
culture was incubated at 37°C for 90 min and then induced with 1 mM IPTG for 30 min Cells were 
subsequently collected, washed once with 1 ml LB medium and suspended in 10 ul LB medium. 10 ug of 
human, biotinylated IgGs were added again and the suspension incubated over night at 4°C with gentle 
shaking. All further steps were exactly the same as in the first selection round. Cells selected after two 
ovrnlght^ S^c" ^ ***** ° t *° ***** SU PP lemented 50 Kanamycin and grown 

Evaluation of selected clones by sequencing and Western blot analysis. Selected clones were grown overnight at 
37 C m 3 ml LB medium supplemented with 50 ug/ml Kanamycin to prepare plasmid DNA using 
standard procedures. Sequencing was performed at MWG (Germany) or in collaboration with TTGR 

For Western blot analysis approximately 10 to 20 ug of total cellular protein was separated by 10% SDS- 
PAGE and blotted onto HybondC membrane (Amersham Pharmacia Biotech, England). The LamB or 
fusion , P rotems were detected ^ing human serum as the primary antibody at a dilution of 
approximately 1:5 000 and anti-human IgG or IgA antibodies coupled to HEP at a dilution of 1:5,000 as 
SS^J T ^ I"," W3S P erformed usi "8 E( =L detection kit (Amersham Pharmacia 

^pH « ^ } ' ^ matlvel y' rabbit anti-FhuA or rabbit anti-LamB polyclonal immune sera were 

ZlTJ^ 11113 ?^ m combination with respective secondary antibodies coupled to HRP for 
tne detection of the fusion proteins. 
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Results 



Screening of bacterial surface display libraries by magnetic activated cell sorting (MACS) using biotinylated Igs 
The libraries LSPn70 in pMAL9.1 and LSPnSOO in pHffill were screened with pools of biotinylatJ? 

"S 8 ?- ^ §AS fr ° m Patient S6ra ° r 8613 from health y individuals (see Example 1: Preparation of 
antibodies from human serum). The selection procedure was performed as described under Experimental 
procedures. Figure 3A shows a representative example of a screen with the LSPn-70 library and PSPn3- 
IgGs^As can be seen from the colony count after the first selection cycle from MACS screening, the total 
wT^ °^ CeUs ,^ COVered i f * *« end * drastically reduced from 2x10? cells to approximately 5x 10* cells, 
whereas the selection without antibodies added showed a reduction to about 2x103 cells (Figure 3A) 
After the second round, a similar number of cells was recovered with PSPn3-IgGs, while fewer than 10 
cells were recovered when no IgGs from human serum were added, clearly showing that selection was 
dr ^ .TT,^ antibodies. To evaluate the performance of the screen, 26 selected 
mZJIk 5? Md T y S u bj6Cted t0 immUn ° bl0t "dy* ™* screening IgG pool (PSPnT) 
2sTt in * i T ieVe&le t *"* ~ 9 ° % ° f * e S6leCted d ° nes shoWed activity wim antibodies 
™ZT * nf V ? SerUm WhGreaS ^ C ° ntro1 Strain «P»*W LamB without a S. pneumoniae 
specific insert did not react with the same serum. In general, the rate of reactivity was observed to lie 

r e XSe3lge t0 90% - COl ° ny ShOW6d ^ ^ S6leCted ^-contained an insert * 

fdS^ ° £ 3 la Fu nUmbCr ° f rand0mly Picked dones < 600 to 1200 P er screen) led to the 

^.7" * * e S 6 " 6 and *» corresponding peptide or protein sequence that was specifically 
recognized by the human serum antibodies used for screening. The frequency with which a specific clone 
is selected reflects at least in part the abundance and/or affinity of the specific antibodies in me serum 

dot s L S ; le ?T ^ ieC °?if S epit ° pe Pre8ented by **» do - * *at regard it is shSngXt 
clones derived from some ORFs (e.g. SP2216, SP0117, SP0641, SP2136, SP2190, Ip0107, SP0082) were 
picked more than 100 times, indicating their highly immunogenic property. Table 1 summarizes tiie dSa 
" * ] 2 perfonned screens - AU clones that are presented* Table 1 have been verified by 
w^*r»o°Ir^ yS1S USmg Wh ° le i CeUular from single clones to show the indicated reactivity 

Z *i /^i°l h ^T S6rUm US6d * ^ r6SpeCtive SCreen - As can be ^en from Table 1, distinct region 
of the identified ORF are identified as immunogenic, since variably sized fragments of the protein! Ze 
displayed on the surface by the platform proteins. proteins are 

It is further worth noticing that most of the genes identified by the bacterial surface display screen encode 

ST ^ ^ t0 *: SUrf3Ce ° f * " ^ ^eted. This I ZTcoT^cl 

with the expected role of surface attached or secreted proteins in virulence of S. pneumoniae. 

huTan'ser^ 886881116 " 1 " "~ MM * * b™™*"* peptide sequences with individual 

Experimental procedures 
Peptide synfliesis 

ItT^™* Syi f? i2ed * 8111311 80316 < 4 m S «P to 288 in parallel) using standard F-moc 

cheimstry on a Rink amide resin (PepChem, Tubingen, Germany) using a SyroTsynthes^ 
(Multisyntech, Witten, Germany). After the sequence was assembled peptides werrelonSS^ 
F^-epsdon-ammohexanoic acid (as a linker) and biotin (Sigma, St ^1*mo7*IZ£ uTa no™* 
amino acid). Pephdes were cleaved off the resin with 93%TFA, 5% triethylsilane and 2% water for ™ 

presence of the correct mass was verified by mass spectrometry on a Reflex m MALDI-TOF (Bruker! 
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Bremen Germany). The peptides were used without further purification. 
Enzyme linked immune assay (EUSA). 

Biotin-labeled peptides (at the N-terminus) were coated on Streptavidin ELISA plates (EXICON) at 10 
pg/ml concentration according to the manufacturer's instructions. Highly specific Horse Radish 
Peroxidase (HRP)-conjugated anti-human IgG secondary antibodies (Southern Biotech) were used 
according to the manufacturers' recommendations (dilution: l,000x). Sera were tested at two serum 
dilutions, 200X and 1,0005c Following manual coating, peptide plates were processed and analyzed by 
the Gemini 160 ELISA robot (TECAN) with a built-in ELISA reader (GENIOS, TECAN). 

Approximately 110 patients and 60 healthy adult sera were included in the analysis. Following the 
bioinformatic analysis of selected clones, corresponding peptides were designed and synthesized. In case 
of epitopes with more than 26 amino acid residues, overlapping peptides were made. All peptides were 
synthesized with a N-terminal biotin-tag and used as coating reagents on Streptavidin-coated ELISA 

pldXSS* 

The analysis was performed in two steps. First, peptides were selected based on their reactivity with the 
individual sera, which were included in the serum pools used for preparations of IgG and IgA screening 
reagents for bacterial surface display. A summary for serum reactivity of 224 peptides representing S 
pneumoniae epitopes from the genomic screen analysed with 20 human sera (representing 4 different 
pools of five sera) used for the antigen identification is shown in Table 2. The peptides were compared by 
tfie score calculated for each peptide based on the number of positive sera and tine extent of reactivity. 
Peptides range from highly and widely reactive to weakly positive ones. Among the most reactive ones 
there are known antigens, some of them are also protective in animal challenge models for 
^^S??^ ' Carria S e or ^P 8 * 8 < e 'S- PspA/SP0U7, serine protease/SP0641, histidine triad 
protein/SP1175). Peptides not displaying a positive reaction were not included in further, more detailed 
studies. 

Second, a large number of not pre-selected individual sera from patients with invasive pneumococcal 
disease or from healthy adults and children were tested against the peptides showing specific and high 
reactivity with the screening sera. Seroconversion during disease was tested for highly positive peptides 
by using three serial serum samples collected longitudinally from patients with invasive pneumococcal 

!TT' J 6 , I diSeaSe ° CCUIred (pre) ' 860011(1 1x1 acute P hase (with* 5 days after onset) 
and the third m the convalescent phase (> 3 weeks after onset) of the disease. Two representative ELISA 
experiments are shown with two different patients, displaying seroconversion to multiple peptides, 
suggesting that epitope-spedfic antibody levels were low before disease occurred, and were induced in 
the acute and convalescent phase (Fig. 5). The antigens showing this antibody profile are especially 
valuable for vaccine development (e.g. SP2216, SP2109, SP1175, SP0117, SP0082) 



Example 5: Gene distribution studies with highly immunogenic proteins identified from S. 



pneumoniae. 
Experimental procedures 



Gene distribution of pneumococcal antigens by PCR. An ideal vaccine antigen would be an antigen that is 

HZTl m fl?^ J*? maj ° rity ° f Strains ° f ^ organism to which the vaccine is directed. In 
order to establish whether the genes encoding the identified Streptococcus pneumoniae antigens occur 
ub quxtousty in S. pneumoniae strains, PGR was performed on a series of independent S. pneumoniae 
settLT P rr S ^f* f ° r * e ° f * ^moniae were Obtained covering Z 

ZnZ S s Pr6Sent * patientS 38 ShoWn * Fi ^ e ^ Oligonucleotide sequent as 

primers were designed for all identified ORFs yielding products of approximately 1,000 bpjf posJbte 
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^!f^:t^r^TZ es - G r^ c DNA of s - — - - p-P-d as 

(1U), 200 nM d^ P i 0 pMo?o7^H T IT*"? V ° W 25 ^ ^ T ^ Phrase 
instructions (Invitrogi. TheN^^ 0 ^deobde and the kit according to the manufacturers 

56°C, 30sec. 72^ 72°a ^ ^ ^ . 3 ° ^ ^ 95 ° C ' 3Q * 30sec - 95 ° C < 

primer pairs. Q P erform ^' ^s conditions had to be adapted for individual 

Results 

indicted 50 strain, Z^^LTZ?™? ' fiSU ?. 4B „ h °"* «» fa SP1604 with all 

a type 14 stram was ^^i^T-T 

genea (SP0667, SP0930) ahowed variation in size and 
in all tested ateaina. S^n^l^SE^T^ ^J* *« — «—*—— 
oomparieon to the type /attain conftanS STZJ^STlS^^^d*^ 
degree of sequence divergence a* inHWo/i ;~ t ui o t correct UNA fragment and revealed a 

Experimental procedures 
Generation of immune sera from mice 
~-mTd^~ 

the OD™ measurement/^ 

followed by a boost after 2 wJTse^^ 1 weTafTe ^ ^ Lv - 

antibody levels were measured by peptide E^S^T SeCOnd n9ecttaL E P ito P e s P ecific 

In vitro expression of antigens 

lysates and bacterial culture supernatant* F™I * 6 P resence of antigens in total 

Ce/Z surface staining 

twica in Hanks Balan^^u^ ^"i^ ^ lmmu "° bl ° t Cells were washed 

CPU in 100M1 HBSS, 0.5% St^SS^Tj^Z^ * 1 X 1» 

100-fold, unbound antibodies were washed aw^vb»7^ mOUSe dUuled 50 to 
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with the computer program CELLQuest. Negative control sera included mouse pre-immune serum and 
mouse polyclonal serum generated with lysates prepared from IPTG induced E. coU cells transformed 
with plasmids encoding the genes lamB oxfliuA without S. pneumoniae genomic insert. 

Bactericidal (kilWig) assay 

Murine macrophage cells (RAW246.7 or P388.D1) and bacteria were incubated and the loss of viable 

Bala^Lt^? T m W ^lT ined by COl ° ny COUntin ^ M brief ' bacteria were cashed twice in Hanks 

"* CeU dGnSity W3S ad i USted to approximately IX 10* CFU in 50ul 
^ a f vo, ^nn 7T T d m ° USe ^ (UP t0 25%) *«* S" mea Pfc complement (up to 5%) in a 
^Z ri f Z Q 6 ^T ^ 4 ° C Pre -°P sonized ^cteria were mixed with macrophages (murine 
ceU line RAW264.7 or P388.D1; 2X 10< cells per lOOul) at a 150 ratio and were incubated af 37°cTa 
rotating shaker at 500 rpm. An aliquot of each sample was diluted in sterile water and incubld for 5 
nun at room temperature to lyse macrophages. Serial dilutions were then plated onto Todd-Hewitt Broth 
agar plates The > plates were incubated overnight at 37°C, and the colonies were counted w"th I 
^Z D M C ° r COUnt6r ^ rnStruments >- ^ -« Eluded mouse pre-u^nT^m 

^rZiH? t T 8en T ated ^ lySatCS Piepared from ^ G ****** «* transformed 
with plasmids harbormg the genes Z«mB or/fcuA without S. pne^niae genomic insert. 



Results 



In vitro expression of antigens. The expression of the antigenic proteins was analyzed in vitro in S 
pnsumomae serotype 4 by using sera raised against E. coli clones harboring plasmids encoding the 
platform protein fused to a S. pneumoniae peptide. First, the presence of specific anXodies ™s 
l^Tt 7 Peptide i ! LISA i—unoblotting using the £. L clone expressing the given epitope 

embedded in LamB or FhuA platform proteins. Positive sera were then analysed by inununblotnng ustl 
Ti^T 1 ly T eS ^ ^ SUpernatan * P-P-d from S. ^— i sero^e 4 strain^ "not 

^T w ^ T ^ 38 3 &St ^ t0 d6tennine Whether a protein is ^sed at all, and" 
under which growth conditions, in order to evaluate surface expression of the polypeptide by FACS 

ZZZEZ antidPated baS6d ° n UteratUre ^ "* ^ " e^rLse^^ 

Cell surface staining of S. pneumoniae. Cell surface accessibility for several antigenic proteins was 
subsequently demonstrated by an assay based on flow cytometry. Streptococci w^ incublZ 

SZTITS P l Cl ° n f 1 J IlOUSe « a P«— lysfte or E. *» hSolg 

plasmids encoding the platform protein fused to a S. pn^monwe peptide, follow by detection wtth 

lysate contains anybodies against surface components, demonstrated by a significant shift in fluorescence 

cliu l P T m ° n ? 4 ^ population - SimiI ar cell surface staininTof S. pneumoniae sert^ 4 

^Sff^^abl 7 4 ^ S6ra raiS6d agajnSt ° f ^ ° £ * e Pneumococcal aSen! 

moStd hv £ H f t ! ^ S ° me ta8tmde * a ^population of the bacteria was not stainel, as 
SSSSfc T ? ^ ^ ^ *» *B). This phenomenon may be a resul of 

levSToarSir^H ? ^T^* ^ ^ of * e bacte ^ insuffident antibody 

levels or partial inhibition of antibody binding caused by other surface molecules or plasma proteins. 

\ aCt f C f a . Opsonophagocytic killing is the cornerstone of host defense against 

exbacellular bacteria such as S. pneumoniae. Cell surface binding of antibodies to bacterial ^tigeS^e 
^cvfes^ ^ (bactericidal) by phagocytic cells (macrophages and neu^pM 

^ *■ PartiCUlar « abated complement 

components C3bi). It has been shown that anti-pneumococcal bactericidal activity of human sera 

~i zv%srz r be r eIated ^ in r protection of vacdnated 

H^'Zr « . ^ Flgure 7 ^^P 1 ^ are shown and in Table 4 a summary is presented on 
bactencidal activity measured by antigen-specific antibodies generated in oT^m ^pon^ 
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epitopes. According to these data, several of the novel pneumococcal antigens induce functional 
anybodies (eg. SP0082, SP2216, SP2136, SP0454, SP0069, SP0369, etc.). Importantly a w^o^ 
protective pneumoruae antigen, PspA (SP0117) is proved to be strongly positive fa the very same assay 

These experiments confirmed the biofaf ormatic prediction that many of the proteins are exported due to 

%Z£ES££ S 4 q r Ce ^ * . addM ° n Sh ° Wed ** *^ ™ I"— ° n * 6 ceTsurfTce of S 
^hnH T £T i" ^ alS ° COnfirm * at 111686 P r ° teins are avail *le for recognition by human 
antibodies wi* functional properties and make them valuable candidates for the ^velopmentTa 
vaccine against pneumococcal diseases. F or a 

Example 7: Identification of pneumococcal antigens inducing protective immune responses 

Experimental -procedures 

Expression of recombinant pneumococcal proteins 

DlT^f ? BWWW/n ? A ^ gene/DNA fragment of interest was amplified from the genomic 

° f S J neUm0n f ae < strain T4 ' Capsular type 4) by PGR using gene specific primers. Apart ITmZl 
gene specific part, the primers had restriction sites that aided fa a direcLnal donfag of mTamphfied 

Pr ° dU ^ ^! ^ aimealinS (SpedfiC) Part ° f primer ra "8 ed between 15-24 bases fa len^ ^ e 
v^r^OVAr^lr 616 ^ 8 ^ With 1116 aPPr ° Priate reStriCti0n en2 3- e - d into plr28b^) 

^^^mSSTf 1 "* f " C ° nfirmed t0 *« >« of E coll 

oitiLS Jfi (INVTTROGEN) that served as expression hosts were transformed. These cells were 
S express the gene of interest. Expression „nH p,, ^^ nf r ^„ c . P ^ gJJ 

f a ? n ^ n * 6 r -o-binant plasmid was grown until log phase fa a required culture volume 
Once the OEW™ of 0.8 was reached the culture was induced with 1 mM EPTG for 3 hours at 37°C^h e 
cells were harvested by centrifugation, lysed by a combination of freeze-thaw method followed bv 
disruption of cells with 'Bug-buster®, NOVAGEN'. The lysate was separated by SSfaStaT fato 
o^ncaHrT^ ^ 7 „ 1UWe (P6llet) Dep6ndin S on the location 

o^ffa ^ S h ^ k T l° U ° Wed - ^ ^ Pr ° tein WaS * * e **** purification of£ 

qS^^T* 7 tmding ^ e abOV6 S -P^atant with Ni-Agarose beads (iSi-NTA-Agaros^ 
V £ *e Presence of the penta Histidine (HIS) at the C or N or both termini of the expressed 
bv bX £ " agarOSe WbJIe 0th6r COntami -^ P-tefas were washed from thTSto 

^avS bv^d^H T ! ^ 6lUted ^ 100 ^ immida2 ° le - d eluate was concentrated, 
assayed by Bradford for protein concentration and checked by PAGE and Western blot. In case the 
pro em was present m the insoluble fraction the pellet was solubilized in buffer containing 8 MUrea The 
punfication was done under denaturing conditions (fa buffer containing 8M U^usL^Tsa^e 
materials and procedure as mentioned above. The eluate was concentrated and dialyzed to remove alTSe 

^z^:iT^ od m ™ e proteins were *— * sds - pag « - — — e 

Animal protection studies 

nmSaSn 1 STS * V' * ^ ^ ^ — W ~ 6 — - Active 

ZX p , A, Hg recombinant proteins were injected subcutaneously and adjuvanted^th 
Complete Freud Adjuvant (CPA). Animals were boosted twice with the same amount of proSn^ur 

ZAW0lTJ£ Om f ete Fr6Und AdjWant ™ 3t ^ 14 "« 28 ' A well-know^ote^^X 
nSff I 38 3 P ° SitiVe COntr ° l WMe no ^«nized (PBS or CFA/IFA adjuvant injected^ 

n^ce served as negative controls. Antibody titers were measured at days 35-38 by eSTusW 2 

Sr?Z r v ^ Pr ° teinS ' ^ ^ d6teimin6d to be - * e r -Seof 200.00-i y 00oS0 (eTpomt 

befo ^r T" n : yat,0n ' ^ miCC W6re ^' eCted With ^O- 300 ^ ™«* sera mtraperitoSy 
before intraperitoneal moculation with S. pneumoniae. Bacterid rnXn^- A ^ 

was determined by OD600nm measurements. In order to determine the real viable cell numbers 
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present in the inoculum prepared from the frozen glycerol stock cultures, cfus were determined via 
plating using six serial dilutions. 10«-10> bacteria/mouse was injected either intravenously in the tail vein 
or intraperitoneally. Protective effect of immunizations was measured by monitoring survival rates for 2 
to 3 weeks post-challenge and was expressed in % of total number of animals (10/ group). 



Restilts 



In the present invention six different pneumococcal antigens identified by bacterial surface display were 
determined to have protective effect in a mouse sepsis/lethality model. The best levels of protection were 
achieved by immunization with recombinant antigens representing the SP2190, SP2216 and SP0667 
protein, while SP0368, SP1732 and SP0498 displayed lower levels of protection (Fig. 8). The protective 
effect was mediated by antibodies as it has been demonstrated by passive serum transfer experiments 
(Fig. 9). Naive mice receiving specific anti-SP2190 and anti-SP2216 antibodies were protected from death 
relative to mice from the negative control groups. Importantly, the combination of these antigens resulted 
in improved protection, as it has been shown in Fig. 9. Passive immunization with 150 jd immune serum 
generated either with recombinant SP2190 or with recombinant SP2216 (supplemented with 150ul naive 
serum) resulted in lower survival rate compared to serum therapy with lOOul each of specific antisera 
(supplemented with lOOul naive serum). These experiments strongly support that combination of these 
antigens has beneficial effects in vaccination against pneumococcal diseases. 

Since the antigens used for immunization were derived from a serotype 4 strain and the challenge strain 
was a serotype 6B, these experiments established that the antigens were cross-protective. 

The SP2216, SP2190 and SP1732 recombinant proteins detected the highest levels of antibodies in sera of 
patients convalescing from invasive pneumococcal diseases, as well as in those of healthy individuals 
exposed to Pneumococcus (children in the household) (data not shown). The most frequently identified 
antigen m bactenal surface display screens was the SP2216 protein. It was a special interest to compare 
the protectivity of the subdomains of this protein selected (N-terminal amino acid sequences) or not 
selected (C-termmal amino acid sequences) by human antibodies (Fig. 10A). Upon immunization with the 
two different domains (expressed as recombinant antigens) it became evident that the immunogenic part 
of the SP2216 protein carried the protective potential, while the non-selected domain was ineffective and 
comparable to the negative control (Fig. 10B). Based on this experiment the epitopes detected by bacterial 
surface display identifies protective epitopes and regions of bacterial proteins and this information can be 
used for rational design of subunit vaccines based on the antigens described in the present invention. 

Example 8: Determination of sequence conservation of protective antigens 

Experimental procedures 

Immunoblotting 

J^t^ff r 783 ^ and , culture ^pernatant samples were prepared from in vitro grown S. pneumoniae 
strains. 60 clinical isolates) representing 48 different serotypes were included in the study 
cSTTT If ? S t0tal P rotein/lane was separated by SDS-PAGE using the BioRad Mini-Protean 3 
Cell electrophoresis system and proteins transferred to nitrocellulose membrane (ECL, Amersham 
Pharmacia). After overnight blocking in 5% milk, hyperimmune mouse sera generated by inmauni^tion 

Tomio^T* P^f 2216 ' SP1732 SP2190 (and SPOllT/PspA^ internal control) derived 
from serotype 4 strain were added at 5,000x dilution, and HRPO labeled anti-mouse IgG was used for 



DNA sequencing 
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?" fT*! °L T 32 ' SP219 ° SP22126 were ""PBfied from the genomic DNA of S. pneumoniae 
(serotype 6B, 9V, 14, 18C, 19F and23F) by PCR using a proofreading polymerase Expand (ROCHE) 
Gene specific primers, ranging between 27-31 bases in length, were used to amplify the entire open 
reading frames. The PCR products obtained were cloned into pCR®2.1-TOPO vector (Invitrogen) The 
recombinant plasmid DNA was purified using a QIAprep® miniprep kit (Qiagen) before the sequence 
was confirmed (MWG). In addition to the seven serotypes, the gene of SP2216 from other 41 different 
serotypes was amplified by PCR and the purified PCR products were sequenced. 



Results 



Identification of conserved antigens inducing antibodies that are cross-reactive with different clinical 
isolates is crucial for the development effective vaccines. It is especially relevant for protein-based 
vaccines targeting pneumococcal diseases, since more then 90 different serotypes oi Streptococcus 
pnuemonwe (Pneumococcus) have been associated with human infections. 

In a thorough analysis it was determined that the antibodies induced by SP2216, SP2190 and SP1732 all 
derived from a serotype 4 strain broadly cross-reacted with all the different serotypes tested in the 

— 1^ % U) - NOtaWy ' ** SP219 ° that showed variation* electrophoretic 

mobility (indicating different sizes) preserved the antibody reactivity strongly suggesting that 
mununodominant epitopes are conserved. In contrast, we detected lower cross-realty wSi antf-Psp^ 
antibodies that is in accordance with the known differences in immunogenic amino acid sequences of L 

hi order to directly address the question whether the identified protective antigens are conserved among 
tSfJ? Sei °y?Zt 1™°"°"™' DNA se ^ ence ^ysis was performed on the SP2216, SP1732 
rt 8eneS ' 16 ? 1732 ^ C ° nSerVed ' few ^ acid *»S» were detected, 

^•o^c T*?* 8 - S T from 47 different d ^ Elates representing 47 different S. pneumoniae 

ZTZ7™£l 6 : W6re det6Cted - d - 2 3f the analysed stains (Fig. 

12). hi the SP1732 gene one or two ammo acid exchanges were detected in the majority (in four of the six) 

1 cZ, J 13> - ^ 2190 ShOW6d 3 Variabmt y ^ aUo acid sequences" 

, "T " g Sen6S " ? " Sh ° Wn ^ 14 ' 1116 - d d ^nons makes it occult to 

calculate an exact ammo aad homology among the different SP2190 variants, but it can be estimated to be 
brtween approx 60 and 90%. However, the amino acid identity was sufficient to induce cross-reactive 
and cross-protective antibodies based on the experiments presented in Fig. 8,9 and 11 
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Table 1: Immunogenic proteins identified by bacterial surface display. 



S. pneumoniae 
antigenic 
protein 


Putative function 
(by homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per 
ORF and 

screen 


Location of 
identified 
immunogenic 
region (aa) 


Seq. 

ID 
(DNA, 
Prot) 


SP0G08 


hypothetical protein 


4-11,35^66-76,101-108,111-119 


G:15 


57-114 


1,145 


SP0032 


DNA polymerase I 
(polA) 


5-27,32-64,92-102,107-113,119-125,133-139,148- 

162,177-187,195-20l,207-214 > 241-251^54-269^85- 

300^02-309^17-324>332-357,365-404,411-425,443- 

463,470-477,479-487^06-512^15-520^32-547^56- 

596,603-610,616-622,624-629,636-642,646-665,667- 

674,687-692,708-720,734-739,752-757,798-820,824- 

851,856-865 


H^9,L*6, 


732-763 


2,146 


SP0069 


Choline binding 
protein I 


14-21^6-44 > 49-66,102-127,162-167,177-196 


G:1,H:Z 
1:1,1044, 
L:3, M:l 


45-109 
145-172 


3,147 


SP0071 


immunoglobulin Al 
protease (iga-1) 


17.35,64-75^1-92,100-119,125-172,174-183,214- 

222,230-236^73-282,287-303^10-315^31-340^92- 

398,412-420,480-505^15-523^25-546^53-575^92- 

598,603-609,617-625,631^639,644-651,658-670,681- 

687,691-704,709-716,731-736,739-744,750-763,774- 

780,784-791,799-805,809-822,859-870^80-885,907- 

916,924-941,943-949,973-986,1010-1016,1026- 

1036,1045-1054,1057-1062,1082-1088,1095-1102,1109- 

1120,1127-1134,1140-1146,1152-1159,1169-1179,1187- 

1196,1243-1251,1262-1273,1279-1292,1306-1312,1332- 

1343,1348-1364,1379-1390,1412-1420,1427-1436,1458- 

1468,1483-1503,1524-1549,1574-1588,1614-1619,1672- 

1685,1697-1707,1711-1720,1738-1753,1781-1787,1796- 

1801,1826-1843 


A:3,C:1, 
D:9,E:9, 
F:4, GOl, 
D34>K:61, 
L:20,M:2 


132-478 
508-592 
1753-1810 


4,148 


SP0082 


Cell wall surface 
anchor 


15-43,49-55,71-77,104-110,123-130,162-171,180- 
192,199-205,219-227,246-254>264-270^79-287,293- 
308312-322^30-342,349-356^69-377^84-394,401- 
406,416-422,432-439,450-460,464-474 > 482-494^01- 
508521-529336-546.553-558 Rft/USQi aco. 

612,616-626,634-646,653-660,673^81,688-698,705- 
710,720-726,736-749,833^48 


C.-9, E:4, 
FS, 1:26, 
L:4> M:67 


1-199 
200-337 
418-494 
549-647 


5,149 


SP0107 


LysM domain protein 


9-30,65-96,99-123,170-178 


A:3,B:16, 
C :15, D :1, 

F :178, ! 


1-128 


6,150 
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antigenic 
protein 


» x Ulauv C I ULLLV. LLUXl 

(by homoloev) 


predicted immunogenic aa** 


No. of 
selected 
clones pe 
ORFand 

screen 


Location of 
identified 
r immunogenic 
region (aa) 


Seq. 
ID 
(DNA, 
Prot) 








M:l 






SP0117 


pneumococcal surface 
protein A (pspA) 


7-32^4^1,96-106,127-136454-163488-199,207- 
238,272-279306-312318-325341-347353-360387- 
393399-406,434-440,452-503375-380389-601,615- 
o^^x»-o4U,&54-66O,674-680 / 696-701 / 710-731 


A:13, B:ll, 

C:10,D:4, 

E:31,F:6, 

G33, 

11:13,1:9, 

K:64, L:32, 

M:46 


1-548 
660-691 


7,151 


SP0191 


hypothetical protein 


i-19^5-44,48-59,77-87,93-99,106-111,130-138,146-161 


E:l, 1:2 


78-84 


8,152 


SP0197 


synthetase, putative 


^^u^Jb-43,64-86,93-99,106-130,132-145,148-165,171- 

177,189-220,230-249,251-263,293-300302-312323- 
329338-356.369^37Q ojaruAio 


L:9 


179-193 


9,153 


SP0212 


Ribosomal protein L2 


30-39,61-67,74-81,90-120,123-145,154-167,169-179,182- 
197,200-206,238-244,267-272 


L.10 


230-265 


10,154 


SP0222 


Ribosomal protein S14 


14-20,49-65,77-86 




2-68 


11,155 


SP0239 


Conserved 
hypothetical protein 


i-9,26-35,42-48^3-61,63-85,90-101,105-111,113- 

121,129-137,140-150,179-188,199-226,228-237,248- 

255,259-285^99-308^14-331^37-343,353-364,410- 
421,436-442 


L:2,M:1 


110-144 


12, 156 


SP0251 


formate 

«*v.c».jr i u, cu. u>icrdoc/ 

putative 


36-47^5-63,94-108,129-13^144-158,173-187,196- 

206^09-238^51-266,270-285,290-295,300-306^33- 

344346-354^66^97,404-410,422-435,439-453,466- 

^/JpiX>^^>zy-543p54-569371 -585^90-596,607- 

618,627-643,690-696,704-714,720-728,741-749,752- 
767,780-799 


G:2, H:7, 
1:1, MS 


225-247 
480-507 


13, 157 


SP0295 


[rpsl) 


io-^^o-/u,ou-yo,iuu-l06 


1:4 


78-130 


14, 158 


SP0330 


sugar binding 
transcriptional 
•egulatorRegR 


18-27,41-46,50-57,65-71,79-85,93-98,113-128,144- 

155,166-178,181-188,201-207,242-262^65-273,281- 

»5303^09318-327 


(3:1,1*1, 
L:4 


36-64 


15, 159 


SP0368 


:ell wall surface 
inchor family protein 

t 

7 
S 
1 


^-2931-44 / 50-59,91-96,146-153,194r201,207-212^32- 

^38,264-278,284-290,296-302326-353360-370378- 

^84,400-405,409-418,420-435,442-460,499-506329- 

»34356-562364-576,644-651,677-684>687-698,736- 

r 43,759-766,778-784308-814,852-858374-896,920- 

25,929^935,957-965,1003-1012,1021-1027,1030- 

04^1081-1087,1101-1111,1116-1124,1148-1159,1188- 


0:1, HS, 
fcl, L:l, 
V£3 


1-70 

154-189 

J22-941 

1445-1462 

L483-1496 


16, 160 



WO 2004/092209 



PCT/EP2004/003984 



-64- 



S. pneumoniae 
antigenic 
protein 



Putative function 
(by homology) 



predicted immunogenic aa** 



1196,1235-1251,1288-1303,1313-1319,1328-13354367- 

1373,1431-14374451-1458,1479-1503,1514-1521,1530- 

1540,1545-15524561-1568,1598-1605,1617-16474658- 

1665,1670-16764679-16894698-17044707.17134732 
1738,1744-1764 



No. of 
selected 
clones per 
ORF and 

screen 



Location of 
identified 
immunogenic 
region (aa) 



Seq. 
ID 
(DNA, 
Prot) 



Penicillin binding 
protein 1A 



6-5131-91404-113,126-137,150-159464-174,197- 
209,215-224^29-235,256-269,276-282^07-313^17- 
348^51^57^76-397,418-437,454-464,485-490,498- 
509347.555374-586,602-619 



B:l, E:l, 
L:13,M;1 



452-530 



17, 161 



hypothetical protein 



25-31^9-47,49-56,99-U4>121-127459-186,228-240,: 
269,271-279^03-315^65-382^95-405,414-425,438^53 



253- E:4> 1:1, 



L:3 



289-384 



18, 162 



SP0378 



Choline binding 
protein C 



9-24,41-47,49-54>68-78,108-114417-122432-140464- 

169,179-186,193-199,206-213,244-251,267-274^89- 

294,309-314*327^33 



G:5,H:4, 
r:l,K:88, 
L:3,M:8 



209-249 
286-336 



19,163 



choline binding 
protein J (cbpj) 



9-28^3^7,69-82,87^93409-117,172-177,201-207,220- 
227,242-247,262-268,305-318,320-325 



K:47,L:6, 
Mx5 



286-306 



20,164 



choline binding 
protein G (cbpG) 



4-10,26-39,47-58,63-73,86-96,98-108,115-123,137- 

143448-155,160-176,184-189,194-204,235-240,254- 

259,272-278 



G:l, K:69, 
M:6 



199-283 



21,165 



hypothetical protein 



4-2633^9,47-53^9-65,76-83,91-97404-112,118- 

137,155-160,167-174,198-207,242-268,273-279,292- 

315320-332,345-354>358-367,377-394,403-410,424- 

439,445-451^53-497311-518335-570373-589392- 

601,604-610 



H:1,I:1, 
L:6 



202-242 



22,166 



cell wall surface 
anchor family protein 



8-3036^5,64-71,76-82,97-103,105-112434-151,161- 

183,211-234,253-268,270-276,278-284,297-305309- 

315357^62366-37^375-384 > 401-407,409-416,441- 

455,463-470,475-480,490-497301-513324-537352- 

559365-576,581-590392-600,619-625,636-644,646^56 



A:1,B:2, 
C:4, E:l, 
F:4 



316-419 



23,167 



sortase, putative 



1732-58,84-99,102-110,114-120,124-135443-158,160 
173,177-196^01-216,223-250,259-267,269-275 



E:l, M:2 



1-67 



24,168 



Sortase, putative 



6-4637-67,69-80,82-133,137-143447-168,182-187,203- 
',214-229,233-242,246-280 



209, 



G:24, 

nao, ui 



53-93 



25,169 



endo-beta-N- 
acetylglucosaminidas 
putative 



4030-5631-89,117-123,202-209,213-218,223-229,248 
26U64-276,281-288303-3083l3-324326-332340- 
346353-372,434^43,465-474314-523356-564 > 605- 
616,620-626,631-636,667-683,685-699,710-719,726- 
732,751-756,760-771,779-788,815-828355-867,869- 



IB:5,C:1, 
E:2,F:1, 
!:2 



1226-1309 
1455-1536 
1538-1605 



26, 170 



WO 2004/092209 



PCT/EP2004/003984 



S. pneumonia 
antigenic 
protein 


e Putative function 
(by homology) 


pxcuicceu. immunogenic aa 


No. of 
selected 
clones pe 
ORF and 

screen 


Location of 
identified 
r immunogenic 
region (aa) 


Seq. 
ID 
(DNA, 
ProL) 






879^97-902,917-924,926-931,936-942,981-1000,1006- 
1015,1017-1028,1030-1039,1046-1054,1060-1066,1083- 

1092,1099-1112,1122-1130,1132-1140,1148-1158,1161- 

1171,1174-1181 1209-1930 1936.19/la fO/io ioc» 

/ *'^ J - AOJ -/ iZ4o- i2o*±. 1256- 

1267,1269-1276,1294-1299,1316-1328,1332-1354,1359- 
1372,1374-1380,1384-1390,1395-1408,1419-1425,1434- 
1446,1453-1460,1465-1471 1 474-1 /im iww ic-ie-ieoo 

1537,1547-1555,1560-1567,1577-1605,1633-1651 








SP0509 


type I restriction- 
modification system, 
M subunit 


4-10^1-39^1^88406-112,122-135,152-158,177-184,191- 
197,221-227.230-246 249-9==, 3fY*-Qi i Q17 on* 90 ? 

344^46-362^65-371,430-437,439-446,453-462,474-484 


hi 


449-467 


27, 171 


SP0519 


dnaj protein (dnaj) 


9-15,24^5,47^5,122-128,160-177,188-196^02-208,216- 
228,250-261,272^303318-324 397-33Q -ano ace 
361,368-373 


A:1,D:2, 
H:2 


108-218 
344-376 


28,172 


SP0529 


BlpC ABC transporter 
(blpB) 


6-14,17-48,55-63,71-90,99-109,116-124,181-189,212- 

223^2-268,270-294,297-304^19-325^40-348^51- 

370,372^78,388-394,406-415,421-434 


A:l, B:3, 
C3,D:1, 


177-277 


29, 173 


SP0564 


hypothetical protein 


21-39,42-61,65-75,79-85,108-115 




11-38 


30, 174 


SP0609 


amino acid ABC 
transporter, amino 
acid-binding protein 


4-17,26-39,61-76,103-113,115-122,136-142,158-192,197- 
203,208-214,225-230,237-251 


1:3 


207-225 


31,175 


SP0613 


metallo-beta- 
lactamase superfamily 
protein 


' ' ^v/^-'^O/Vtrt \J,/ { ±ryo f yD- 1U4> 1 14-1 19, 1 27- 1 

150,153-159,173-179,184-193,199-206,222-241,248- 

253,257-280,289-295^13-319^22-342^49-365^68- 

389^93-406,408^13,426-438,447-461,463-470,476- 
495,532-537,543-550 


1:12 


225-246 


32, 176 


SP0641 


Serine protease 

c 
] 
1 
1 
1 
1 


1-29,68-82,123-130,141-147,149-157,178-191,203- 

215,269-277^00-307^27-335^59-370^74-380^82- 

588^93^0,410^7,434-442,483-492,497-503^05- 

513^-540^64-569,601-607,639-647,655-666,693- 

706,712-718,726-736,752-758,763-m,774-780,786- 

^99,806-812^20-828,852-863,884-892,901-909,925- 

)32,943-948 / 990-996,1030-1036,1051-1059,1062- 

.068,1079-1086,1105-1113,1152-1162,1168-1179,1183- 

191,1204-1210,1234-1244,1286-1295,1318-1326,1396- 

401,1451-1460,1465-1474,1477-1483,1488-1494,1505- 

510,1514-1521,1552-1565,1593-1614,1664-1672,1677- 

685,1701-1711,1734-1745,1758-1770,1784-1798,1840- 


A:19, B:72, 
~:34> D:5, 
B:21,F:86, 
3:26, 

3:86, 1:17, 

-:130, 

vt-29 

1 

2 


1-348 

373-490 1 

573-767 

?03-1043 

L155-1198 

L243-1482 

L550-1595 

1682-1719 

793-1921 

.008-2110 


33,177 



WO 2004/092209 



PCT7EP2004/003984 



S. pneumonia, 
antigenic 
protein 



Putative function 
(by homology) 



-66- 



predicted immunogenic aa** 



No. of 
selected 
clones per 
ORJ? and 

screen 



Location of 
identified 
immunogenic 
region (aa) 



Seq. 

ID 
(DNA, 
Prot) 



18474852-18734885-18914906-19114931-19394957- 
19704977-1992,2014-2020,2026-2032,2116-2134 



SP0648 



beta-galactosidase 
(bgaA) 



10-35^9-52,107-112,181-188,226-236,238-253,258- 
268^75-284,296-310^26-338^45-368^80-389^91- 
408,410^118/420-429,444-456,489-505373-588,616- 
623,637-643,726-739,741-767,785-791,793-803,830- 
847,867-881,886-922,949-956,961-980,988-1004,1009 
1018,1027-1042,1051-1069,1076-1089,1108-1115,1123- 
1135,1140-1151,1164-1179,1182-1191,1210-12214223- 
1234,1242-1250,1255-1267,1281-1292,1301-1307,1315- 
1340,1348-1355,1366-1373,1381-14134417-1428,1437. 
1444,1453-1463,1478-1484,1490-14964498-1503,1520- 
1536,1538-1546,1548-1570,1593-1603,1612-16254635- 
16494654-1660,1670-1687,1693-17004705-1711,1718- 
1726,1729-1763,1790-18134871-18814893-1900,1907- 
1935,1962-1970,1992-2000,2006-2013,2033-2039,2045- 
2051355-2067,2070-2095,2097-2110^115-212U150- 
2171,2174-2180,2197-2202,2206-2228 



C:1,E:1, 
F:1,G:1, 
H:4,L1, 
M:2 



1526-1560 



34,178 



Zinc metalloprotease 
ZmpB, putative 



4-1735-48,54-76,78-107,109-115,118-127,134-140445- 

156,169-174,217-226,232-240,256-262,267-273^16- 

328,340-346^53-360,402-409,416-439,448-456,506- 

531^40-546370-578^86-593^95-600,623-632,662- 

667,674-681,689-705,713-724,730-740,757-763,773- 

778,783-796,829-835,861-871,888-899,907-939,941- 

955,957-969,986-1000,1022-1028,1036-1044,1068- 

1084,1095-11024118-1124,1140-1146,1148-1154,1168- 

1181,1185-11904197-1207,1218-1226,1250-1270,1272- 

12814284-1296,1312-1319,1351-1358,1383-14094422- 

1428,1438-1447,1449-1461,1482-14894504-1510,1518- 

15274529-1537,1544-1551,1569-15754622-1628,1631- 

16374682-1689,1711-1718,1733-1740,1772-1783,1818- 
1834,1859-1872 



A:9, B:25, 
C:13,D:7, 
8:14,11:77, 
G:12, 
H:10, 
K:67, L:13, 
M:6 



1-64 
128-495 



35,179 



pneumococcal s 
protein, putative 



surface 8-28,32-37, 



62-69,119-125,137-149,159-164473-189,200 
221-229,240-245,258-265,268-276^87-293^96- 
502,323-329 



205, 



A:72, B:80, 
C:90,D:20, 
:12,F:53 



-95 



36,180 



»-N- 9-18,2 
acetylmuramoylalanin 146, 



25-38,49-63,65-72,74-81,94-117,131-137,139- 
149-158,162-188,191-207,217-225,237-252^55- 
269,281-293^01^26,332-342347-354^63-370373- 



D3 



75-107 



37, 181 



WO 2004/092209 



PCT/EP2004/003984 



S. pneumonia 
antigenic 
protein 


e Putative function 
(by homology) 


- u / - 

predicted immunogenic aa** 


No. of 
selected 
clones pe 
ORF and 

screen 


i uucauon or 
I identified 
r| immunogenic 
I region (aa) 


oeq. 
in 

(DNA, 
Prot) 






380,391-400,41^424,441-447 








3P0749 


brandied-chain amin< 
acid ABC transporter 


3^24,64-71^1^7,96-116,121-128,130-139,148-155,166- 
173,176-184^03-215,231-238,243-248,256-261^80- 
286^88^06^14-329 


E:2, 1:8, 
L:8 


67-148 


38, 182 


SP0770 


ABC transporter, 
ATP-binding protein 


4.10,19-3746^2,62-81^3-89,115-120,134-139,141- 
151,168-186,197-205,209-234,241-252^22-335^39- 

345^63-379^85-393,403-431,434-442,447-454,459- 
465,479-484,487-496 


L:2 




39,183 


SP0785 


conserved 
hypothetical protein 


10-35,46-66,71-77^4-93,96-122,138-148,154,172,182- 

213,221-233^45-263,269-275,295-30W03^09^11- 

320,324-336,340-348,351-359,375-381 


C:l, E£, 
fcl 


1111-198 




SP0914 


nodulin-related 
protein, truncation 


14-25^0-4^47-61,67-75,81-91,98-106,114-122,124- 
135,148-193,209-227 


L.2 


198-213 


41,185 


SP0930 
SP0943 


choline binding 
protein E (cbpE) 

Gid protein (gid) — 


5-18,45-50,82-90,97-114,116-136,153-161,163-171^12- 
219^21-227^40-249,267-28Wll-317^28-337^75- 
381^90-395,430-436,449-455,484-495^38-543^48- 
55^556-564^80-586^96-602 


E:4, G:2, 
H:1,L2, 


£93-606 


42,186 




9-25,28-3^37-44,61-68,75-81,88-96,98-111,119-133,138- 
150,152-163,168-182,186-194,200-205,216-223^36- 
245^57-264^79-287,293-304,311-318^25-330^40- 
346,353^358,365-379,399-409,444-453 


E:2, L:24 


303-391 


43,187 


SP0952 


alanine 

dehydrogenase, 
authentic frameshift 
(aid) 


16-36,55-61,66-76,78-102,121-130,134-146,150-212,221- 
239,255-276,289^22^29-357 


G:3,H:4 


29-59 


44,188 


SP1003 


conserved 
hypothetical protein 
[PAT) 


8-27^8-74,77-99,110-116,124-141,171-177,202-217,221- 

228^59-265^75-290,293-303^09-325^35-343^45- 

351^65-379^84-394,406-414,423-437,452-465,478- 

507,525-534^54^60,611-624,628-651,669-682,742- 

'47,767-778,782-792,804-812,820-836 


£8,D:5, [ 

E:13,F:3, 

Vfc2 


559-451 


45, 189 


SP1004 


Conserved ~ 
lypothetical protein 

t 

6 
7 
8 
1 


J-28^9-45^6-62,67-74,77-99,110-117,124-141,168- 

L76,200-230,237-244,268-279,287-299^04-326^29- 

i35348-362370-376379-384>390-406,420-429,466- { 

l71,479-489,495^04 > 529-541^45^53361-577^98- E 

04,622-630,637-658^72-680,682-688^90-696,698- [ 

09,712-719,724-736,738-746,759-769,780-786,796- 

04 > 813-818^60-877,895-904,981-997,1000-10H1021- 
029 


*:5,B:4, h 
2:4,1*9, L 
*12,F:4, h 
-1:3,1:1, W 
-1 |8 


-162 
.06-224 

14-514 
64-938 


46, 190 



WO 2004/092209 



PCT7EP2004/003984 



-68 



S. pneumoniae 
antigenic 
protein 



Putative function 
(by homology) 



predicted immunogenic aa** 



No. of 
selected 
clones per 
ORF and 

screen 



Location of 
identified 
immunogenic 
region (aa) 



Seq. 
ID 
(DNA, 
Prot) 



SP1124 



glycogen synthase 
(glgA) 



4-11,19^9,56^6,68-101,109-116,123-145,156-165,177. 
185,204-221,226-23^242-248,251-256,259-265,282- 
302307-330340-349,355-374377-383392-400,422- 
428,434-442,462-474 



mi 



266-322 



47,191 



SP1154 



fgAl protease 



14-43,45-57,64-74^0-87,106-127,131-142,145-161,173- 
180,182-188,203-210,213-219,221-243,245-254^04- 
31Wl«20^42-348,354-365^72-378^94-399,407- 
431,436-448,459-465,470-477,484-490^04-509^31- 
537^90-596,611-617,642-647,723-734,740-751,754- 
762,764-774,782-797,807^812^24-831,838-845,877- 
885^92-898,900-906,924-935,940-946,982-996,1006- 
1016,1033-1043,1051-1056,1058-1066,1094-1108,1119- 
1126,1129-1140,1150-1157,1167-1174,1176-1185,1188- 
1201,1209-1216,1220-1228,1231-1237,1243-1248,1253- 
1285,1288-1297,1299-1307,1316-1334,1336-1343,1350- 
1359,1365-1381,1390-1396,1412-1420,1427-1439,1452- 
1459,1477-1484,1493-1512,1554-1559,1570-1578,1603- 
1608,1623-1630,1654-1659,1672-1680,1689-1696,1705- 
1711,1721-1738,1752-1757,1773-1780,1817-1829,1844- 
1851,1856-1863,1883-1895,1950-1958,1974-1990 



A:6, B:2, 
C:9,D:3, 
E:4> F:2, 
G:6,H:4> 
1:13, L:12 



172-354 
384-448 
464-644 
648-728 
1357-1370 



48,192 



SP1174 



conserved domain 
protein (PAT) 



8-27,68-74,77-99,110-116,124-141,169-176,201-216,: 
227,258-264,274-289,292-302^08-324^34-342^44- 
350^64-372377-387^99-407,416-429,445-458,471- 

481,483-500^18^27^47-553,604-617,621-644,662- 
675,767-778,809-816 



,220- B; 



:14,C:17, 
D:6, E:18, 
F:16,I:1, 
K:5, Ul, 
M:8 



15-307 

350-448 

496-620 



49,193 



SP1175 



conserved domain 
protein 



4-17,24-2933-59,62-84,109-126,159-164,189-204,208- 
219,244-249^74-290,292-302308-324334^42344- 
350378-389391^97,401-409,424-432,447-460,470- 
479,490-504321-529338-544349-555370-577383- 
592,602-608,615-630,635-647,664-677,692-698,722- 
731,733-751,782-790,793-799 



C:3,D^, 
:9,F^ 
P£2,M:4 



56-267 

337-426 

495-601 



50, 194 



SP1221 



type II restriction 
endonuclease 



12-22,49-59,77-89,111-121,136-148,177-186,207- 
213,217-225,227-253,259-274,296-302328-333343- 
354374-383,424-446,448-457,468-480,488-502307- 
522344-550353-560364-572387-596,604-614,619- 
625,629-635,638-656,662-676,680-692,697-713,720- 
738,779-786333-847361-869380-895397-902,911- 
917,946-951,959-967,984-990,992-1004,1021-1040,1057. 



G:2, H:l, 
K:1,L:4 



381-403 



51, 195 
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1067,1073-1080 








5P1227 


DNA-binding 
resoonse regulator 


4-10,26-31,46-56,60-66,70-79^6-94,96-102,109-118432 


E:l, U3 


81-149 


52,196 


SP1241 


amino acid ABC 
transporter, amino 

acid-hinrfirto" nrn 


i-21,26-37,48^0,71-82,109-117,120-128,130-136,142- 
147,181-187,203-211,216-223,247-255,257-284^16- 
3*3/o / o-o /ypyo-4UU,423-435,448-456,479-489^12- 
576,596-625,641-678,680-688,692-715 


B2, CI, 
ErZLl 


346-453 


53,197 


SP1287 


signal recognition 
particle protein (£fh) 


10-16,25-31,34-56^8-69,71-89,94-110,133-176,186- 
193^08-225,240-250,259-266^02-^07^35-34W76- 

0OO, t kX\j"*kXX> 


B :8, G :8, 
H:3,M:1 


316-407 


54,198 


SP1330 


N- 

acetylmannosamine- 
6-P Goimera.^p 
putative (nanE) 


11-29,42-56,60-75^2-88,95-110,116-126,132-143,145- 
160,166-172,184-216 


L:45 


123-164 


55, 199 


SP1374 


Chorismate sythetase 
(aroO 


11-29,5^63,110-117,139-152,158-166,172-180,186- 
x^o,^j.3-^o,Z4U-zoi^Uz-3Z3^30-335^40-347^50- 
366,374-381 


G:1,L:29, 


252-299 


56,200 


SP1378 


hypothetical protein 


iD-z//50-4z,t>u-J>b,67-74,112-136,141-153,163-171 / 176- 

189,205-213,225-234,241-247,253-258,269-281,288- 

298,306-324,326^34,355-369,380.387 


H:2 


289-320 


57, 201 


5P1429 




/-15,iy-41^6-72,91-112,114-122,139-147,163-183,196- 
209,258-280;326^8,357-363,391-403,406-416 


H:4 


360-378 


58,202 


SP1478 


oxidoreductase, 
aldo/keto reductase 
family 


11-18,29-41,43-49,95-108,142-194,204-212,216-242,247- 
256,264-273 


H:ll 


136-149 


59,203 


SP1518 


ciu u>ci v c(i 

hypothetical protein 


18-24>33-40,65-79 / 89-102,113-119,130-137,155-161 / 173- 

179,183-203,205-219,223-231,245-261^67-274,296- 

306^11^21^3(^341,344-363^69-381,401-408,415- 

4.0*7 AV7-A A A /f CQ ACA A*l*\ atto Act a rnn — _ _ . 

*z/,^3/-444,4W-464,472-478,484-508^17-524^26- 
532,543-548 


A:10, E:4, 
G:5,H:1 


59-180 


60,204 


5P1522 


conserved domain 
protein 


5-13,52-65,67-73,97-110,112-119,134-155 


B:4, C:6, 
E:l, H.-7, 
L:3 


45-177 


61, 205 


SP1527 


Dligopeptide ABC 
transporter 


S-28^4^3^7-67,75-81,111-128,132-147,155-163,165- 
176,184-194,208-216,218-229,239-252,271-278^28- 
^,363^76,381-388,426-473,481-488,492-498,507- 
513,536-546 / 564-58£590-601,607-623 


A-l, B:l, 
C:4,F:1, 
G:26, 

Htl8, 1:10, 
tO, Mil 


148-269 
120-450 
310-648 
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SP1573 


lysozyme (lytQ 


4-12,20-38^9-75,83-88,123-128445-152,154-161483- 
188,200-213,245-250,266-272^06-312^32^39^57- 

369,383-389 ^0=^4/19 /icc^m-i Aorr xzm 


A:40, B:27 
C:24, D:2, 
E:6, G:ll, 
K:l 


, 1-112 


63,207 


SP1604 






A:l, G3, 
E:1,G:1, 
fcl 


70-154 


64, 208 


5P1661 


cell division protein 
DivIVA 


26-42,54-61,65-75,101-107,123-130,137-144,148- 
156,164-172 1 77-1 Q9 910^901 oai oeq 


E:2 


157-249 


65,209 


SP1664 


yhariF protein (ylmF) 


29-38,61-67,77-87,94-100,105-111,118-158 


B:l, C:42, 
13 


1-97 


66,210 


SP1676 


M-acetylneuraminate 
lyase, putative 


7-21,30-48^1-58,60-85,94-123,134-156,160-167,169- 
183,186-191^16-229,237-251,257-267,272-282,287-298 


Ha 


220-243 


67,211 


SP1687 


(nanB) 


^^4-4/^6-65,69-76,83-90,123-134,143-151,158- 
178,197-203,217-235,243-263^03-309^20-333^38- 
348^67-373^87-393,407-414,416-427,441-457,473- 
482,487-499^01^09^14-520^30-535^77-583^90- 
602,605-612,622-629,641-670,678-690 


B:3, E:2, 
L:1,M£ 


37-71 
238-307 


68,212 


SP1693 


neuraminidase A 
(nanA) 


7-40,121-132,148-161,196-202,209-215,221-235,248- 

0 0*71 OOH OOO OOP nor» aaa n/M- . _ _ _ _ 

^,z/l-2«O / 288-295^30-339^95-409,414-420,446- 
451^75-487^56-563,568-575,580-586388-595,633- 
638,643-648,652-659,672-685,695-700,710-716,737- 
742,749-754,761-767,775-781,796-806^23-835,850- 
863,884-890,892-900,902-915,934-941 


C:3, D:5, 
E:3,F:1, 
G:7, H:l, 
r:3,K^0, 
L:4 


406-521 


69, 213 


SP1732 


protein kinase 


y-l»,24-4t>>l-bb,o7-77^5-108,114-126,129-137,139- 

146,152-165,173-182,188-195,197-204,217-250,260- 

274,296-313^43^66,368^84,427-434,437-446,449- 

XFR 4.7ft— ASKA. AQn KCtc coo enn e^^> r-r>-« r-i-^ 

*^/*'o-*o^,^2-5Uo,522-527 / 562-591,599-606,609- 
518,625-631,645-652 


E:2,H:1 


577-654 ; 


70, 214 


SP1735 


nethionyl-tRNA 
brrnyltransferase 
£mt) 


13-20,26-37,41-53^6-65^1-100,102-114,118-127,163- 
188,196-202,231-238,245-252,266-285^93-298^01-306 


K:13,M:13 


19-78 


71,215 


SP1759 j 
t 

s 


ireprotein 
ranslocase, SecA 
ubunit (secA-2) 

4 
5 
7 


10-23^2-42,54-66,73-91,106-113,118-127,139-152,164- 

173,198-207,210-245,284-300^13-318^30-337^39- 

W6^54-361^87-393,404-426,429-439,441-453,467- 

t73,479-485,496-509336-544 > 551-558360-566369- 

74^78-588,610-615,627^635,649-675,679-690^98- 

16,722-734,743-754,769-780,782-787 


:6,L;2, 
v*2 


180-550 ~~ 


72, 216 
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SP1772 


cell wall surface 
anchor family protein 


6-39,42-50,60-68,76-83,114-129,147-162,170-189,197- 

205,217-231^39-248,299-305^38-344352-357^71- 

377^80-451,459-483,491-499^07-523^37-559^87- 

613,625-681,689-729,737-781,785-809,817-865,873- 

881^89-939,951-975,983-1027,1031-1055,1063- 

1071,1079-1099,1103-1127,1151-1185,1197-1261,1269- 

1309,1317-1333,1341-1349,1357-1465,1469-1513,1517- 

1553,1557-1629,1637-1669,1677-1701,1709-1725,1733- 

1795,1823-1849,1861-1925,1933-1973,1981-2025,2029- 

2053,2061-2109,2117-2125,2133-2183,2195-2219,2227- 

2271,2275-2299^307-2315,2323-2343^347-2371^395- 

2429,2441-2529,2537-2569,2577-2601,2609-2625,2633- 

2695,2699-2737,2765-2791,2803-2867,2889-2913,2921- 

2937,2945-2969,2977-2985,2993-30093023^3045,3073- 

30993111-31673175-32153223-32673271-32953303- 

3351,3359-33673375-34253437-34613469-35133517- 

35413549-35573565^5853589-36133637-36713683- 

37473755-37953803-38193827-38353843-39513955- 

3999,4003-4039,4043-4115,4123-4143,4147-4171,4195- 

4229,4241-4305,4313-4353,4361-4377,4385-4393,4401- 

4509,4513-4557,4561-4597,4601-4718,4749-4768 


B:9,C:1, 
D:1,F:13, 
Gil, H:3, 
hi, L:l, 


74-171 

452-559 

2951-3061 


73,217 


OT>1 on A 


general stress protein 
24, putative 


16-22,30-51,70-111,117-130,137-150,171-178,180- 
188,191-196 


D4 


148-181 


74,218 


SP1888 


oligopeptide ABC 
transporter, ATP- 
binding protein AmiE 


6-19,21-463(^56,80-86,118-126,167-186,189-205^11- 
242,244-267,273-286,290-297307-316320-341 


H:l 


34-60 


75, 219 


SP1891 


oligopeptide ABC 
transporter, 


5-2633-43,48-5438-63,78-83,113-120,122-128,143- 
152,157-175,185-192,211-225,227-234,244-256,270- 
281^84-290304^10330-337348-355362-379384- 
394,429-445,450^74,483-490311-520337-546348- 
554361-586390-604,613-629 


A:2,B:3, 
E:1,F:1, 
G:13, H:8 


149-186 
285-431 
573-659 


76,220 


SP1937 


Autolysin (lytA) 


5-26,49^9,61-6733-91,102-111,145-157,185-192,267- 
272,279-286,292-298306-312 


D:3,F:1, 
G:1,H:2, 
It'll 

IV. XX, J.YJL. X 


134-220 
235-251 


77,221 


SP1954 


serine protease, 
subtilase family, 
authentic frame 


5-19,72-7933-92,119-124,140-145,160-165,167-182,224- 
232,240-252,259-270301-310313-322332-343 347- 
367384-398,416-429,431-446,454-461 


C:43, E:6, 
L:50 


1-169 


78,222 


SP1980 


cmp-bmding-factor 1 


8-17,26-3136^2,75-83,93-103,125-131,135-141,150- 


H:9 


127-168 


79,223 
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(cbfl) 


194,205-217,233-258,262-268,281-286 








SP1992 


anchor family protein 




B:5, C:l, 
F:4,I:1 


46-161 


80,224 


SP1999 


catabolite control 
protein A (ccpA) 


6-13,18-27^9-48,51-59,66-73,79-85,95-101,109-116,118- 
124 144-164 166-177 1 Q^-^nA 01 c ooi 007 

236,242-249,252-259,261-270,289-301318-325 


1:2 


12-58 


81,225 


SP2021 


glycosyl hydrolase 


4-10^6-32^60,97-105,117-132,138-163,169-185,192- 
214,219-231,249-261,264-270,292-308343-356385- 
392398-404 408-417 4AR-441 


L:3 


24-50 


82,226 


SP2027 


Conserved 
hypothetical protein 


10-40,42-48^1-61,119-126 


A:l, E:l, 

G:19, 

1*12*1:16, 

LilD 


1-118 


83,227 


SP2039 


conserved 

hypothetical protein 


5-17^0-58,71-83,103-111,123-140,167-177,188-204 


G:l,t:3 


116-128 


84,228 


SP2048 


Conserved 
hypothetical protein 


4-9,11-5037-70,112-123,127-138 


fcl,L:4 


64-107 


85,229 


SP2051 


Conpetence protein 


9-3931-67 


D:1,G:3, 
1:8, L:26 


1-101 


86,230 


SP2092 


UlJP-glucose-1- 
phosphate 
uridylyltransferase 
(galU) 


5-14,17-25^8-46^2-59^5-93,99-104111-120,122- 
131,140-148,158-179,187-197^04-225,271-283,285-293 




139-155 


87,231 


SP2099 


protein IB 


^-/u,/>^U,yz-lU»,112-127,152-164,166-172,181- 
199,201-210,219-228,247-274,295-302322-334336- 
346353-358396-414419-425,432-438,462-471^18- 
523331^536,561-567 S76-RAQ £9fi imi h/lc 

671,697-710,718-731,736-756,765-771,784-801 


Al, B.-9, 
Oil, D:l, 
E:6,F:1, 
H:4, K*l 


626-653 


88,232 


5P2108 


Maltose ABC 
transporter 


8-28,41-51,53-62,68-74,79-85,94-100,102-108,114- 
120,130-154,156-162.175-180 198-204. 9nfi-9iQ 9ai 

294308-318321-339362-368381-386393-399,407-415 


G:10,H:1, 
L:iu, M:l 


2-13 


89,233 


SP2120 


hypothetical protein 


4-39,48-65,93-98,106-11^116-129 


fc2 


10-36 


90,234 


SP2128 


transketolase, N- 
terminal subunit 


25-3235-50^6-71,75-86,90-96,123-136,141-151,160- 
179,190-196,209-215,222-228,235-242,257-263^70-280 


H2 


209-247 


91,235 


SP2136 


choline binding 
protein PcpA 


5-2931-38^0-57,62-7533-110,115-132,168-195,197- 
206^16-242,249-258,262-269333-340342-350363- 
368376^92,400-406^10-421,423-430,436-442,448- 
454,460-466,471-476,491-496311-516331-536351- 


C3,F:1, 
G:24, 
rfc32,L13, 
&177, 


27-70 
219-293 
441-504 
512-584 


92,236 
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556,571-576,585-591399-605 


L:34,M:18 






SP2141 


related protein 


d— 19 1/1— 34 /17 'TIS QQ IfM 1IY7 tic lOO i An i ao inr -t *>»~ 

*t- Aii, / -/ ;>,oc>- 1U4, 1U/-1 15,133-140,148-185,187- 

196^07-212,224-256,258-265,281-287,289-296^98- 

308325-333345-355365-371382-395,424-435,441- 

457,465-47^483-491,493-505,528-534336-546352- 

558375-584389-600,616-623 


L:3 


576-591 


93,237 


SP2180 


conserved 

hypothetical protein 


4-76, 78-89, 91-126, 142-148, 151-191, 195-208, 
211-223, 226-240, 256-277. 279-285, 290-314. 317- 
323. 358-377. 381-387, 391-396, 398-411, 415-434, 
436-446, 454-484, 494-512, 516-523, 538-552, 559- 
566, 571-577, 579-596, 599-615, 620-627. 635-644. 
694-707, 720-734, 737-759, 761-771 




313-329 


94, 238 


SP2190 


choline binding 
protein A (cbpA) 


7-38,44-49,79-89,99-108,117-123,125-132,137-146,178- 
187,207-237,245-255322-337365-387^98-408,445- 


A:6, B:12, 

C:9,D:6, 

E:30, F:8, 

G:65, 

H:72,I:76, 

K.-222, 

L:99,M:37 


111-566 


95,239 


SP2194 


protease, ATP- 
binding subunit 


0-^,^^-^^^-43,50-04,/ /- 117, 137-144,158-163,205- 

210,218-224,229-236,239-251,263-277^99-307323- 

334353-384388-396399-438,443-448,458-463,467- 

478,481-495303-509,511-526,559-576395-600,612- 

645,711-721,723-738,744-758,778-807 


H:l 


686-720 


96,240 


SP2201 


protein D (cbpD) 


x\j^>o / 3D-&i,/4i-o% izy- 138,158-163,203-226,243- 
252.258-264 77Q- r3 XYy < W9J¥>Q'iai 'Xftc* am .acic aia /toe 


B:4,C:3, 
D:l, E:7, 
F:1,G:1, 
H£, K:26, 
Ml 


184-385 


97,241 


SP2204 


ribosomal protein L9 


4-9,19-24,41-47,75-85,105-110,113-146 


£fc3,L:4 


^5-62 


98,242 


SP2216 


secreted 45 kd protein 
- homology to glucan 
binding protein 
(GbpB) S.mutant 


4-2532-67,117-124,131-146,173-180,182-191,195- 

206^15-221,229-236,245-252,258-279,286-291,293- 

302314-320327-336341-353355-361383-389 


A;130, 

r5:414, 

C:450, 

D:162, 

E:166, 

F:284, 

G:90, 

H:16, 1*4/ 

K:10,L:29, 


1-285 


99,243 
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M:ll 






SP-NRF1 


Choline binding 
protein 


14-32J38-50 73-84 Q3-1 iftQ ft A 


H:l 


40-70 


100, 244 














ARF0408 


Hypothetical protein 


5-26 


r «3 




101, 245 


ARF0441 


Hypothetical protein 


23-28 


rx.j> 


11 OQ 

l»3-o9 


102, 246 


ARF0690 


Hypothetical protein 


8-14 


L:2 


21-34 


103,247 


AJRF0878 


Hypothetical protein 


4-13,20-29,44-50,59-74 




41-69 


104,248 


ARF0921 


Hypothetical protein 


4-9,19-42, 48-59,71-83 


£VL4 


57-91 


105, 249 


ARF1153 




4-14 


M:7 


10-28 


106, 250 


AKF1515 


Hypothetical protein 


£-*r*.o,o4r*±L,K&-j 1,01-11 1, I4y-loo, i5o-l67,172-180,182- 
203,219-229 


G:4, H:5 


27-49 


107,251 


ARF1519 1 


Hypothetical protein 


17-27 


rjr.Q 


23-32 


108, 252 


ARF1905 


Hypothetical protein 


18-24 


rj.O 

Hl2 


28-38 


109,253 


ARF2044 


Hypothetical protein 


9-15 


tl:o 


13-27 


110, 254 


ARF2155 


Hypothetical protein 


13-22 


ruo 


18-29 


111, 255 


ARB2199 


Hypothetical protein 


17-26 




2-11 


112, 256 


CRF0129 




4-33 


L:4 


16-32 


113,257 


CRF0200 


Hypothetical protein 


4-1037-43.54-84 92-127 


H:5, L:l 


15-62 


114,258 


CRF0236 


Hypothetical protein 


4-14,20-3235-60 69-75 70-90 lfYumo 1 1 A lAft 


L:3 


124-136 


115, 259 


CRF0394 




none 


H;7 


2-13 


116, 260 


CRF0408 


Hypothetical protein 


4-13,28-42 


L:ll 


42-57 


117, 261 


CRF0430 


Hypothetical protein 


4-1427-44 


f'.vl tl.O 

u:4, rto 


14-35 


118, 262 


CRF0498 


Hypothetical protein 


4-12 


ri:4 


1-27 


119, 263 


CKF0519 


Hypothetical protein 


4-18.39-45,47-74 


G:5, H:3 


35-66 


120,264 


CRF0573 


Hypothetical protein 


8-20,43^77 — , 


L\?, L:9 


17-36 


121, 265 


CRF0713 


Hypothetical protein 


4-3035-45,51-57 


r 


35-49 


122, 266 


CRF0722 


Hypothetical protein 


4-24,49-57 


U.lo 


15-34 


123, 267 


CRF0764 


Hypothetical protein 


4-22 


LI4 


8-27 


124, 268 


CRF1079 


Hypothetical protein 


13-2532-59,66-80 


n.o 


21-55 


125, 269 


CRF1248 


Hypothetical protein 


4-10,2433 35-42.54-65 72-82 98-108 


11:1 


15-30 


126, 270 


CBF1398 


Hypothetical protein 


8-19 


Hrl, L:3 


17-47 


127,271 


CRF1412 


Hypothetical protein 


12-18,40-46 


L:8 


31-52 


128,272 


CRF1467 


Hypothetical protein 


4-2035-78,83-102,109-122 


fc4 


74^6 


129, 273 


CRF1484 


Hypothetical protein 


7-17,21-41,46-63 


1-5 


2-20 


130, 274 


CRF1587 


Hypothetical protein 


30-37 


G:3, H:3, 


2-33 


131, 275 
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Sm pj tCtttTlOtlT&t 

antigenic 
protein 


i uumve luxiludu 
(bv homoloev) 


predicted immunogenic aa** 


No. of 
selected 
clones per 

<TRF anrl 

V-/XV-L (Ulll 

screen 


Location of 
identified 
immunogenic 
region xuSL} 


Seq. 
ID 
(DNA, 
riot,) 








L-4 






CRF1606 


Hypothetical protein 


4-13,17-25 


L:3 


1-15 


IT? 97£ 


CRF1623 


Hypothetical protein 


17-31,44-51 


M*6 


20-51 


too 077 


CRF1625 


Hypothetical protein 


20-30 


L10 


5-23 




CKF1640 


Hypothetical protein 


13-33,48-71 


15 


92-110 




CRF1702 


Hypothetical protein 


4-9,50-69,76-88,96-106,113-118 


L:6 


12-34 


136 980 


CRF1825 


Hypothetical protein 


4-24 


L:ll 


6-26 


T37 9ft1 


CRF1883 


Hypothetical protein 


7-26 


H:61, L:77 


14-30 


138, 282 


CRF1991 


Hypothetical protein 


9-39,46-68,75-82,84-103 


H*6 T »9 




139, 283 


CRP1992 


Hypothetical protein 


4-30,33-107 


LVi./ 


CO OA 


140, 284 


CRF2004 


Hypothetical protein 


4-12 


L>.0 


O ESI 


141, 285 


CRF2030 


Hypothetical protein 


12-18,29-37 


ri:5,L:l, 
M:l 


6-37 


142,286 


CRF2065 


Hypothetical protein 


4-21,33-52,64-71 


LI, M:6 


16-37 


143, 287 


CRP2232 


Hypothetical protein 


9-19 


L:3 


2-30 


144,288 




1 


— _ 
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Table 2. Immunogenicity of epitopes in peptide ELISA 
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Table 3: Gene distribution in S. pneumoniae strains. 



PCTYEP2004/003984 



ORF 



Common name 



Gene distribution 
(present of 50) 



Amino acid 
substitutions (in 
serotype 14 strain)" 



Homology 0 



Seq. 
ID (DNA, 
Prot) 



SP0008 



hypothetical protein 



n.d. 



n.d. 



1, 145 



SP0032 



DNA polymerase I (polA) 



n.d. 



n.d. 



2,146 



5P0069 



Choline binding protein I 



0/166S 



3,147 



SP0071 



SP0082 



SP0107 



SP0117 



immunoglobulin Al protease (iga-1) 



Cell wall surface anchor 



50 



LysM domain protein 



50 



pneumococcal surface protein A 
(pspA) 



rud. 



0/477# 



5/385 



1/173 



n.d. 



4,148 



5, 149 



6,150 



7,151 



hypothetical protein 



n.d. 



n.d. 



8,152 



dihydrofolate synthetase, putative 



n.d. 



n.d. 



9,153 



Ribosomal protein L2 



50 



0/232 



10,154 



Ribosomal protein S14 



n.d. 



n.d. 



11,155 



Conserved hypothetical protein 



n.d. 



n.d. 



12, 156 



formate acetyltransferase, putative 



n.d. 



n.d. 



13,157 



ribosomal protein S9 (rpsl) 



50 



1/121 



14, 158 



sugar binding transcriptional 
regulator RegR 



rtd. 



n.d. 



15, 159 



cell wall surface anchor family 
protein 



46 



4/422# 



16,160 



Penicillin binding protein 1A 



50 



1/346 



17, 161 



hypothetical protein 



n.d. 



n.d. 



18, 162 



Choline binding protein C 



29 



0/114 



19,163 



choline binding protein J (cbpj) 



50 



2/104 



20,164 



choline binding protein G (cbpG) 



50 



3/171# 



21, 165 



hypothetical protein 



48 



l/303# 



22,166 



cell wall surface anchor family 
protein 



10 



0/298# 



23,167 
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Pfim m nn n,imp 

\-UUUUUil lUUtlC 


ijene custriDuuon 
(present of 50) 


Amino acid 
substitutions (in 
serotype strains 


Homology 0 


Seq. 
ID (DNA, 

XT0L) 


SP0609 


amino acid ABC transporter, amino 
acid-binding pro 

or 


kJVJ 






31, 1/5 


SP0613 


metallo-beta-lactamase superfamily 
protein 


n.d. 


n H 




I/O 


SPG641 


Serine protease 


rud. 


n #-1 




*30 i r 7 r 7 
OO, 1// 


SP0648 


beta-galactosidase (bgaA) 


50 


0/304 




34,178 


SP0664 


Zinc me tailor) rotpacp ZmoB nnfaHvo 


n.a. 


n.d. 




35, 179 


SP0667 


pneumococcal surface protein, 
putative 


45 


18/297 




36,180 


SP0688 


UDP-N-acetylmuramoylalanine~D- 
glutamate ligase 


n.d. 


n.d. 




37, 181 


SP0749 


branched -chain amino arirl AKf 
transporter 


en 


4/303 




38, 182 


SP0770 


ABC transporter, A'IP-binding 
protein 


50 






39, 183 


SP0785 


conserved hypothetical protein 


50 


0/304 




40,184 


SP0914 


nodulin-related protein, truncation 


n.d. 


rua. 




41, 185 


5P0930 


choline binding protein E (cbpE) 


47 


1 '7/OQ/L 
X/fZ.zt*k 




42, 186 


SP0943 


Gid protein (gid) 


Il.U. 


— J 

n.a. 




43, 187 


SP0952 


alanine dehydrogenase, authentic 
frameshift (aid) 


' n rl 

XI. U, 


_ j 
n.a. 




44, 188 


SP1003 


conserved hypothetical protein (PAT) 


n.d. 


n r! 

IL.U. 




45, 189 


SP1004 


Conserved hypothetical protein 


rud. 


IL.ll. 




4b, 190 


SP1124 


glycogen synthase (glgA) 


n.d. 


n H 

II. vU 




4/, 191 


SP1154 


IgAl protease 


28 


1 ^M7n* fiflmiccincr 

i.&f±/\jf ouiiiisoirig 




48, 192 


SP1174 


conserved domain protein (PAT) 


n.d. 


ad. 




49, 193 


SP1175 


conserved domain protein 


n H 


n.a. 




50, 194 


5P1221 


type II restriction endonuclease 


n.d. 


n.d. 




51, 195 


SP1227 


DNA-bindinff rpsnoncp rpoiilafrvr 


n.a. 


n.d. 




52, 196 


SP1241 


amino acid ABC transporter, amino 
acid-binding protein 


50 


0/285 




53,197 


SP1287 


(ffti) 




0/300 




54,198 


SP1330 


N-acetylmarmosamine-6-P 
epimerase, putative (nanE) 


14 






55, 199 


SP1374 


Chorismate sythetase (aroC) 


50 


0/289 




56, 200 


SP1378 


conserved hypothetical protein 


n.d. 1 


n.d. 




57, 201 


SP1429 


peptidase, U32 family 


50 


8/305 




58,202 


SP1478 


oxidoreductase, aldo/keto reductase 


n.d. 


ilcL 




59,203 
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ORF 


Common name 


Gene distribution 
(present of 50) 


Amino acid 
substitutions (in 
serotype 14 strain)* 


xiuuiuiugy V/ 


oeq. 
Prot) 




family 










SP1518 


conserved hypothetical protein 


50 


4/313; 3 additional 




60,204 


SP1522 


conserved domain protein 


n.d. 


n.d. 




61, 205 


SP1527 


oligopeptide ABC transporter 


50 


0/463 




62,206 


SP1573 


lysozyme (lytQ 


n.d. 


n.d. 




63, 207 


SP1604 


hypothetical protein 


50 


3/138 




64,208 


SP1661 


cell division protein DivIVA 


50 


3/236 




£5 ?ftQ 

DO, ZU7 


SP1664 


ylmF protein (ylmF) 


50 


0/164 




OD, Z1U 


5P1676 


N-acetylneuraminate lyase, putative 


n.d. 


n.d. 




67, 211 


SP1687 


neuraminidase B (nanB) " 


n.cL 


n.d. 




68, 212 


SP1693 


neuraminidase A (nanA) 


n.d. 


n.d. 




69,213 


SP1732 


serine/threonine protein kinase 


49 






70, 214 


SP1735 


methionyl-tRNA formyl transferase 
(rmt) 


n.d. 


n.d. 




71, 215 


SP1759 


preprotein translocase, SecA subunit 
(secA-2) 


rud. 






72,216 


SP1772 


cell wall surface anchor family 
protein 








73, 217 


SP1804 


general stress protein 24, putative 


rud. 


n.d. 




74, 218 


SP1888 


oligopeptide ABC transporter, ATP- 
b in ding protein AmiE 


rud. 


n.d. 




75,219 


SP1891 


oHgopeptide ABC transporter, 


n.d. 


n.d. 




76, 220 


SP1937 


Autolysin (lytA) 


50 


0/275 




77,221 


5P1954 


serine protease, subtilase family, 
authentic frame 


12 


U/oUSfF 




78,222 


5P1980 


cmp-bmding-factor 1 (cbfl) 1 


rud. 


n.u. 




79,223 


SP1992 


cell wall surface anchor family 
protein 


50 






80,224 


SP1999 


catabolite control protein A (ccpA) 


n.d. 


n.d. 




Q1 OOC 
OJL, JLZO 


SP2021 


glycosyl hydrolase — 


n.d. 


n.d. 






SP2027 


Conserved hypothetical protein 


rud. 


n.d. 




83,227 


SP2039 


conserved hypothetical protein 


rud. 


n.d. 




84, 228 


SP2048 


Conserved hypothetical protein 


50 


8/1 34 




85,229 


SP2051 


Conpetence protein CglC ~~ 


50 


Q/QO 




86,230 


SP2092 


UTP-glucose-l-phosphate 
uridylyltransferase (galU) 


n.d. 


n.d. 




87, 231 


SP2099 


Penecillin binding protein IB 


n.d. 


n.d. 




88,232 


SP2108 


Maltose ABC transporter 


50 


1/279 




89,233 


srzizu hypothetical protein 


n.d. 


n.d. 




90,234 
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ORP 


Common name 


Cpup riiafriVnif-inn 

(present of SO) 


/uuino aUU 
substitutions (in 
serotype 14 strain)* 


nomoiogy u 


oeq. 
m fDTSIA 

11/ 

Prot) 


SP2128 


transketolase, N-terminal subunit 


n.d. 


rud. 




91,235 


SP2136 


choline binding protein PcpA 


45 


1/382 




92,236 


SP2141 


glycosyl hydrolase-related protein 


n.d. 


rud. 




93, 237 


SP2180 


conserved hypothetical protein 


n-d. 


n.d. 




94, 238 


SP2190 


choline binding protein A (cbp A) 


47 


for: 48,8%; rev: 2 /17# 




95, 239 


SP2194 


ATP-dependent dp protease, ATP- 
binding subunit 


50 


1/262 




96,240 


5P2201 


choline binding protein D (cbpD) 


50 






97, 241 


SP2204 


ribosomal protein L9 


IbUi 


n.a. 




98,242 


SP2216 


secreted 45 kd protein - homology to 
glucan binding protein (GbpB) 
5. mutant 


50 






99,243 


SP-NRF1 


Choline binding protein 


n.d. 


n.d. 




100, 244 














ARF0408 


Hypothetical protein 


n.d. 


n.d. 






ARF0441 


Hypothetical protein 


n.d. 


n.d. 




1fl9 OAA 
lUZ, Z*tO 


ARF0690 


Hypothetical protein 


n.d. 


rud. 




±UJ, Z4/ 


ARF0878 


Hypothetical protein 


n.d. 


rud. 






ARF0921 


Hypothetical protein 


n.d. 


rud. 




inc 54.0 


ARF1153 


Hypothetical protein 


n.d. 


n.d. 






ARF1515 


Hypothetical protein 


n.d. 


n.d. 




107, 251 


ARF1519 


Hypothetical protein 


n.d. 


n.d. 




lflfi 259 


ARF1905 


Hypothetical protein 


n.d. 


n.d. 




IftQ 9^ 


ARF2044 


Hypothetical protein 


n.d. 


n.d. 




iin 9ca 


ARF2155 


Hypothetical protein 


rud. 


n.d. 




111 OKK 
LLL, ZDO 


ARF2199 


Hypothetical protein 


n.d. 


n.d. 




llz, Zoo 


CRF0129 


Hypothetical protein 


n.d. 


n.d. 




llO, ZD/ 


CRF0200 


Hypothetical protein 


n.d. 


n.d. 




11A 9*;ft 
LL% ZOO 


CRF0236 


Hypothetical protein 


iud. 


n.d. 




11C OEQ 
J.XO, Z37 


CRF0394 


Hypothetical protein 


rud. 


n.d. 




116 9^n 

no, zou 


CRF0408 


Hypothetical protein 


n.d. 


n.d. 




117 9A1 
XX/, zoi 


CRF0430 


Hypothetical protein 


rud. 


n.d. 




11ft 9A9 
XXO, ZOZ 


CRF0498 


hypothetical protein 


n.d. 


n.d. 




11Q 9AQ 
1X7, ZOO 


CRF0519 


Hypothetical protein 


n.d. 


n.d. 




120,264 


CRF0573 


Hypothetical protein 


rud. 


rud. 




121,265 


CRF0713 


Hypothetical protein — 


rud. 


rud. 




122,266 


CRF0722 


Hypothetical protein 


rud. 


n.d. 




123,267 


CKF0764 


Hypothetical protein — — — 


rud. 


rud. ! 




124, 268 
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Common name 


Gene distribution 
ipresent or ouj 


Amino add 
substitutions (in 

sprnhmp 14. strain^* 


Homology 0 


Seq. 

1L/ IIJIMA, 

Prot.) 


CRF1079 


Hypothetical protein 


rud. 


n.d. 




125, 269 


CRF1248 


Hypothetical protein 


rud. 


rud. 




126, 270 


CRF1398 


Hypothetical protein 


rud. 


n .d. 




127, 271 


CRF1412 


Hypothetical protein 


rud. 


n.d. 




128, 272 


CRF1467.1 


Hypothetical protein 


n.d. 


n .d. 




129, 273 


CRF1484 


Hypothetical protein 


rud. 


rud. 




ion *yjA 


CRF1587 


Hypothetical protein 


n.d. 


n.d. 




J.0 1, z/o 


CRF1606 


Hypothetical protein 


rud. 


n.d. 




1 V) OVA, 
1«2Z, Z/O 


CRF1623 


Hypothetical protein 


n.d. 


n.d. 




I.OO, z/ / 


CRF1625 


Hypothetical protein 


n.d. 


n.d. 




lOrt, Z/O 


CRF1640 


Hypothetical protein 


n.d. 


n.d. 




loD, Z/7 


CRF1702 


Hypothetical protein 


rud. 


rud. 




loo, ZoU 


CRP1825 


Hypothetical protein 


n.d. 


n A 




137, 2ol 


CRF1883 


Hypothetical protein 


n.d. 


n A 

n. a* 




-f An Ann 

138, 282 


CRF1991 


Hypothetical protein 


iud. 


n H 

ii. a. . 




139, 283 


CRF1992 


Hypothetical protein 


n.d. 


n A 

rua.. 




140, 284 


CRF2004 


Hypothetical protein 


rud. 


rud. 




141, 285 


CRF2030 


Hypothetical protein 


n.d. 


n.d. 




142, 286 


CRF2065 


Hypothetical protein 


rud. 


rud. 




143,287 


CRF2232 


Hypothetical protein 


rud. 


n.d. 




144>288 















I 
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Table 4. 



ORF 


Common Name 


FACS 


PK 


ARF0878 


hypothetical protein 


+ 


nd 


ARF0921 


hypothetical protein 


+ 


nd 


CRF0236 


hypothetical protein 


++ 




CRF0573 


hypothetical protein 


+ 


nd 


CRF1412 


hypothetical protein 


4 


nd 


CRF1702 


hypothetical protein 


+ 


1 IU 


CRF1992 


hypothetical protein 


++ 


++ 


SP0008 


hypothetical protein 


+ 




SP0069 


Choline binding protein I 


++ 




SP0082 


Cell wall surface anchor 


+ 




SP0117 


pneumococcal surface protein A (pspA) 


+++ 


| | | 


SP0212 


Ribosomal protein L2 


+ 


++ 


SP0295 


ribosomal protein S9 (rpsl) 


++ 


+++ 


SP0368 


cell wan surface anchor family protein 


++ 


+++ 


SP0369 


Penecillin binding protein 1A 


++ 


++ 


SP0377 


Choline binding protein C 


++ 


++ 


SP0378 


choline binding protein J (cbpJ) 


++ 


nri 


SP0390 


choline binding protein G (cbpG) 


++ 


4. 


SP0454 


hypothetical protein 


++ 


+++ 


SP0463 


cell wall surface anchor family protein 


4. 


++ 


SP0466 


sortase, putative 


++ 


++ 


SP0468 


Sortase, putative 


++ 


++ 


SP0519 


dnaJ protein (dnaJ) 


44 


+ 


SP0609 


amino add ABC transporter, amino acid-bind 


44 


+ 


SP0641 


Serine protease 


4 




SP0664 


Zinc metalloprotease ZmpB 


4 




SP0749 


branched-chain amino acid ABC transporter 


4. 


T 


SP0770 


ABC transporter, ATP-binding protein 


++ 


++ 


SP1154 


IgAl protease 


++ 




SP1287 


signai recognition particle protein (ffh) 


+ 


++ 


SP1330 


N-acetylmannoseamine-6-P 


++ 




SP1429 


peptidase, U32 family 


+ 


4*4 


SP1527 


oligopeptide ABC transporter 


4. 


Alia 


SP1759 


preprotein translocase, SecA subunit (wrong done!!!) 


4. 




SP1772 


cell wail surface anchor family protein 


4. 


4, 


SP1891 


oligopeptide ABC transporter 


+ 


++ 


SP1937 


Autoiysin (lytA) 


+ 




SP1954 


senne protease, subtilase family, auth frame 


+ 


++ 


SP1980 


cmp-binding-factor 1 (cbfl) 


+ 




SP2108 


Maltose ABC transporter — ' 


+ 


++ 


SP2136 


choline binding protein PcpA 


+ 


++ 


SP2190 


choline binding protein A (cbpA) 


+ 


++ 


SP2194 


ATP-dependent Clp protease, ATP-bind subu 


++ 


++ 


SP2201 
SP2216 


choline binding protein D (cbpD) 
secreted 45 kd protein 


+ 
+ 


++ 
++ 



